AD-780  115 


INTERACTIVE  SY ST E M S  R E SE A RC H 
M.  I.  Bernstein 

System  Development  Corporation 


Prepared  for: 

Advanced  Research  Projects  Agency 
30  April  1974 


DISTRIBUTED  BY: 


Nation!  Tachnical  Information  Senrico 
U.  S.  IfPARTMEL  OF  CCttMERCE 

5285  Port  Royal  Road,  Spri'igfidd  Va.  22151 


AD780115 


TM-5243/001/C 


INTERACTIVE  SYSTEMS  RESEARCH 
INTERIM  REPORT  TO  THE  DIRECTOR 
ADVANCED  RESEARCH  PROJECTS  AGENCY 

FOR  THE  PER  IOE 
1  OCTOBER  1973  TO  31  MARCH  1974 


30  APRIL  1974 


D  D  C 

1 


TNI*  ftCPOMT  WAS  MOOUCCO  •»  SOC  »N  Of  COHTNA(T  D  AMC  t  5  73  £  0010 

A»AA  0*OC«  NO  22S4  WOttAi.  C  OOC  *0  3030 


T'l 

ft 


„  ,aVI  A 
.  .  L  -4 


Reproduced  by 

NAT’ONAL  TECHNICAL 
INFORiVIAiION  SERVICE 

U  $  repartn-ent  of  Commorco 
f#pringf»eld  VA  22151 


System  Development  Corporation 

3500  Colorado  Avenue  ■  Santa  Monica,  California  90400 


Security  CUt»ific«tion _ _ _  _ 


DOCUMENT  CONTROL  DATA  -  R  &  D 

(Stcurltf  cSmttltlcmttmn  tf  till*,  body  of  mnd  in  v'..|  tmnotmtlon  mmt  bo  mntmrmd  uhwn  thm  owtulf  Wfoff  fo  clA*«l/l»f> 


J  REPORT  TITLE 

Interactive  Systeirs  Research:  Interim  Report  to  the  Director,  Advanced  Research 
Projects  Agency,  for  the  Period  1  October  1973  to  31  March  1974 


i  1  OHI01NATINC  activity  (Cor.or.lt  tuHtor)  [1*.  ritFOAT  ttCUAITT  CLAIIIFIC  ATION 

System  Development  Corporatior  |  UNCLASSIFIED 

2500  Colorado  Avenue 
Santa  Monica,  California  90406 


111.  GROUP 


t  ottCRiFTivt  NOTH  (Trp.  .1  rep  art  mri  Inc  tut  tv  dtl.c) 

Technical 

9  AUTHORISI  (Fttal  MM,  miild 7#  Initial,  fill 

M.  I.  Bernstein 

•  REPORT  OATE 

7m.  TOTAL  NO.  OF  RASES 

To.  NO.  OF  REPS 

30  April  1974 

-ii  r  </s~ 

M.  CONTRACT  OR  GRANT  NO 

N.  ORIGINATOR’S  REPORT  NUMSCRt*) 

DAHC15-73-C-0080 

A.  FROJtCT  NO 

3D30 

C. 

TM-5243/001/00 

U.  OTMEN  MtFOAT  NOKI  (Anr  other  number.  #Mf  err  be  hiIai^ 

d . 

(Me  rmpmrt) 

to  distribution  statement 

Approved  for  public  release;  distribution  unlimited 


I 


JUPPLEMIN T  ART  NOTES 

12.  IPONIORING  MILITARY  ACTIVITY 

ARPA/Information  Processing  Techniques 

»J  ABSTRACT 


Reproduced  by 

NATIONAL  TECHNICAL 
INFORMATION  SERVICE 

U  S  Department  of  Commerce 
SpringfwkJ  VA  22151 


DD  ,fr.J473 


uncl;  SSIFIED 
Security  Clattification 


TM-5243/001/00 


INTERACTIVE  SYSTEMS  RESEARCH: 
INTERIM  REPORT  TO  THE  DIRECTOR, 
ADVANCED  RESEARCH  PROJECTS  AGENCY, 

FOR  THE  PERIOD 
1  OCTOBER  1973  TO  31  MARCH  1974 


30  APRIL  1974 


M.  I.  BERNSTEIN 
(213)  393-9411,  EXT  6117 


TMIS  It  t  PON  T  NAS  mOUUCIO  •  ▼  IDC  IN  KNPOHMANCI  Of  CONTRACT  O  A  NC  1  5  -73  -C  008  0 
anna  ON  DC  ■  NO  2254  N*o«r«M  COOC  NO.  3030 

▼MC  Views  Amo  conclusions  containco  nc  n  c  in  ari  THOSE  or  The  AUTHOR  ano  should 
NOT  NC  IHTERRRETEO  AS  NCCCSSANILV  RIRtEtlNTlHC  THC  OFFICIAL.  NOLICICS  C*TMCH 
CINNCSStO  ON  IMNLICO  Of  T  NC  AOVANCCO  RCSCANCN  HNOjCC'S  ACTNCV  ON  THE  U  S 
COVCNNM1NT 


System  Development  Corporation 

25DD  Colorado  Avenue  «  Santa  Monica,  California  SC4Q6 


System  Development  Corporation 
30  April  1974  it  TM-5243/001/00 

TABLE  OF  CONTENTS 

Page 

1.  INTRODUCTION  AND  SUMMARY . 1 

1.1  Speech  Understandino  Research  .  1 

1.2  Lexical  Data  Archive . 2 

1.3  Common  Information  Structures  .  2 

2.  SPEECH  UNDERSTANDING  RESEARCH  .  3 

2.1  Introduction  .  3 

2.2  Progress  end  Present  Status . 3 

2.2.1  System  Overview . 4 

2.2.2  The  Discourse-Level  Controller  .  6 

2.2.3  The  Parser . 7 

2.2.4  The  Lexical  Matching  Procedure . 

2.2.5  The  Acoustic-Phonetic  Processor  .  9 

2.2.6  System  Testing . 12 

2.2.7  Related  Research . 14 

2.3  Plans . 14 

2.4  Staff . 17 

2.5  Documentation  and  Publications  .  17 

3.  LEXICAL  DATA  ARCHIVE . 18 

3.1  Introduction . 18 

3.2  Progress  and  Present  Status  .  18 

3.2.1  Words  Analyzed  o»»i  Sources . 20 

3.2.2  Sample  Semantic  Analyses  .  21 

3.2.3  Explanatory  Notes  on  Descriptive  Constants  .  24 

3.2.4  Conceptual  Analyses  of  the  Notions  'Cause', 

•Intentional  Action',  and  'Responsible'  .  26 

3.7.5  Other  Activities  .  32 

3.3  Plans . 33 

3.4  Staff . 33 

3.5  Documentation  and  Publications . 33 


20  April  1974 


ii 


System  Development  Corporation 
TM-524 3/001/00 


TABLE  OF  CONTENTS  (Cont'd) 

Page 

4.  COMMON  INFORMATION  STRUCTURES  .  34 

4.1  Introduction . 34 

4.2  Progress  and  Present  Status  .  36 

4.2.1  Common  Data  Description  Language  (CDCL)  Specification  .  37 

4.2.2  Types  of  Data  Conversion  Functions . 38 

4.3  Plans . 39 

4.4  Staff . 40 

LIST  OF  FIGURES 

Figure  Page 

2-1  Overview  of  VDMS .  5 

2-2  The  Parser .  8 

2-3  Lexical  Matching  Procedure  .  10 

2-4  A-Matrix  Processing  .  13 

4-1  Data  Flow  in  the  Data  Conversion  Process . 35 


30  J.oril  1974 


1 


System  Development  Corporation 
TM -5243/001/00 


1.  INTRODUCTION  AND  SUMMARY 


This  document  is  an  interim  report  to  the  Advanced  Research  Projects  Agency 
(ARPA)  for  the  period  1  October  1973  through  31  March  1974  on  System  Develop¬ 
ment  Corporation's  (SDC's)  research  and  development  program  In  Interactive 
Systems  Research  (ISR) .  As  of  the  beginning  of  this  period,  the  program 
includes  three  projects:  (1)  Speech  Understanding  Research,  (2)  Lexical 
Data  Archive,  and  (3)  Common  Information  Structures.  The  primary  emphasis 
is  on  Speech  Understanding  Research  and  related  problems,  though  the  general 
intent  of  the  overall  program  is  to  develop  basin  technology  for  improved 
man-machine  interactive  systems  with  application  to  a  variety  of  anticipated 
military  needs. 

The  Speech  Under  landing  Research  p-oject  contains  many  developments  that  will 
be  material  in  enhancing  and  improving  interactive  systems  by  making  them  more 
capablu  and  productive  when  usad  by  the  casual  user.  Of  particular  importance 
is  the  continuing  effort  to  permit  such  a  user  to  easily  and  effectively 
communicate  in  language  forms  that  are  natural  to  him.  In  support  of  this 
and  other  work,  a  project  to  create  .*  central  archive  of  lexical  information 
for  all  contractors  working  in  Speech  Understanding  or  other  language-based 
iystoms  has  been  started. 

The  world  of  information  processing  has  been  a  continuously  evolving  one  since 
its  creation .  Of  late,  the  fact,  that  we  are  becoming  ever  more  dependent 
on  data  bases  has  become  obvious  (sometimes  painfully  so) .  The  necessity 
for  the  user  of  an  information  system  to  be  able  to  move  his  data  base  is  as 
important  as  a  smooth,  well  engineered  interface.  Toward  .is  end,  the  work 
on  minimizing  the  effort  and  cost  of  moving  a  data  base  from  one  system  to 
another  is  continuing. 

The  following  paragraphs  summarize  the  project  activities  for  the  past  half 
year. 

1.1  SPEECH  UNDERSTAND INC  RESEARCH 

The  Speech  Understanding  Research  (SUR)  project  continues  to  progress  toward 
its  goal  of  creating  a  demonstrable  prototype  Voice-controlled  Data  Manage¬ 
ment  System  (VDMS)  using  free -form  spoken  English  as  input.  The  two  parallel 
development  efforts  of  last  year  in  acoustic -phonetic  processing  and  linguistic 
processing  were  merged  into  a  single  system  that  was  demonstrated  to  the  ARPA 
SUR  Group  Review  Team  in  December  1973.  Work  has  continued  on  all  components 
of  the  system,  with  the  objective  of  both  increasing  the  vocabulary  and 
loosening  the  grammar  and  syntax  of  the  language  accepted  by  the  system.  The 
improved  version  of  VDMS  should  be  demonstrable  in  the  fall  of  1974. 
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1.2  LEXICAL  DATA  ARCHIVE 

The  Lexical  Data  Archive  project  began  this  past  October  witji  the  objective 
of  providing  a  centrally  collected  body  of  lexical  information  on  the  union  of 
lexicons  used  by  the  ARPA  SUR  Contractors  in  their  collective  and  individual 
endeavors.  Toward  this  end,  the  Semantically  Oriented  Lexical  Archive  (SOLAR) 
has  been  designed,  and  much  of  the  relevant  data  has  been  either  located  or 
acquired.  Several  SOLAR  files  already  exist,  and  others  are  being  created. 
Information  from  these  existing  files  is  available  tc  its  using  community 
jr  ally  at  present.  Several  examples  of  files  contained  in  SOLAR  jure 
presented.  Within  the  next  six  months,  more  expensive  information  will  be 
available,  most  of  it  automatically. 

1.3  COMMON  INFORMATION  STRUCTURES 

Work  is  continuing  on  creating  the  methodology  and  necessary  components  of  a 
system  for  moving  data  bases  from  one  system  to  another  at  minimal  effort  and 
cost.  Current  conventional  methods  have  been  studied,  and  a  unique  approach 
has  been  devised  that  makes  maximal  use  of  the  Data  Management  System  tools  by 
which  data  bases  are  created,  updated,  and  queried.  Three  descriptive 
languages  needed  to  describe  the  transfer  process  have  been  defined.  They 
are  a  Common  Data  Description  Language  (CDDl) ,  a  Common  Data  Translation 
Language  (CDTL) ,  and  a  Common  Data  Format  Language  (CDFL) .  A  first  version 
of  the  CDDL  is  presented.  Work  will  continue  on  the  creation  of  the  other 
tvo  languages,  arid,  eventually,  the  method  will  be  implemented  and  tested. 
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2.  SPEECH  UNDERSTANDING  RESEARCH 

2.1  INTRODUCTION 

The  continuing  long-term  goal  of  the  SDC  Speech  Understanding  Research  (SUR) 
project  is  to  develop  and  implement  a  data  management  system  that  is  controlled 
and  operated  by  its  users  through  free-form  spoken  English.  The  basic  approach 
taken  to  achieve  this  goal  is  distinguished  by  a  modular  system  architecture 
that  embodies  phonological  and  linguistic  processes  and  an  acoustic-phonetic 
processor.  The  system  architecture  enables  a  complete  assembly  of  multidirectional 
parsing  processes  to  operas  in  parallel  on  the  same  or  different  segments  of  an 
input  utterance.  Two  major  advantages  are  obtained  from  this:  (1)  the  system  may 
start  working  on  the  least  ambiguous  portions  of  the  input,  and  (2)  predictions 
need  not  be  limited  to  near  neighbors  of  a  recognized  input  segment  but  may  be 
applied  to  any  portion  of  the  entire  utterance. 

The  acoustic-phonetic  processor  contains  the  processes  that  extract  acoustic 
information  from  the  speech  signal  and  make  acoustic-phonetic  labeling  decisions. 
The  processor  we  are  developing  reflects  the  fact  that  the  speech  signal  is 
never  wholly  unambiguous;  any  attempt  to  precisely  label  phones  and  their 
boundaries  murt  recognize  and  allow  for  this  ambiguity  in  mapping  the  extremely 
large  number  of  speech  sounds  into  the  relativelv  small  set  of  acoustic-phonetic 
transcription  symbols.  Accordingly,  in  this  processor,  each  acoustic -phonetic 
segment  is  multiply  laoeled,  and  each  label  is  assigned  a  score.  Scores  are 
based  on  a  measure  function  that  is,  in  turn,  based  or.  feature  parameters 
previously  developed  for  each  speaker. 

As  a  first  step  toward  the  long-term  goal,  we  decided  to  construct  and  refine 
a  limited  voice-controlled  data  management  system  (VDMS)  that  could  accept 
continuous  speech  ar.d  be  demonstrably  usable  by  at  least  two  speakers.  Within 
this  system,  limitations  were  planned  with  respect,  to  both  the  vocabulary  and 
the  syntax  cf  the  English  subset  permitted. 

2.2  PROGRESS  AND  PRESENT  STATUS 

At  the  beginning  of  the  present  contract  year,  two  separate  versions  of  VDMS 
had  been  constructed  and  tested:  Version  A,  which  operated  on  the  SUR  lab¬ 
oratory  Raytheon  704  mini-computer  in  conjunction  with  an  IBM  370/145,  and 
which  incorporated  the  modular  system  architecture,  ari  Version  B,  which 
operated  entirely  on  the  Raytheon  704,  and  which  embodied  the  multiple¬ 
labeling  philosophy  of  the  acoustic-phonetic  processor  described  above.  Both 
versions  allowed  t'ne  user  to  access  a  data  base  of  information  about  the  length 
beam,  draft,  armament,  and  other  characteristics  of  submarines  in  the  naval 
fleets  of  the  United  States,  Soviet  Union,  and  United  Kingdom.  The  total 
vocabulary  of  each  system  was  approximately  150  words.  The  query  language 
used  to  access  this  information  could  be  described  by  about  35  syntax 
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equations  for  Version  A  and  about  30  syntax  equations  for  Version  B;  the 
difference  is  accounted  for  by  the  fact  that  Version  A  contained  report 
generation  capabilities  that  Version  B  did  not.  Typical  queries  that  could 
be  accommodated  by  either  system  are: 

"Total  quantity  where  type  equals  nuclear  and  country  equals  USA." 

"Print  type  where  missiles  greater  than  seven." 

Both  of  these  initial  versions  of  VDMS  had  been  tested  with  a  large  number  of 
utterances  and  had  achieved  reasonably  good  results.  The  first  major  task 
undertaken  during  this  contract  year  was  the  construction  of  a  single  full- 
scale  version  of  VDMS  that  combines  the  best  elements  of  the  two  versions.  This 
new  version  of  VDMS  (Ritea,  [1])  was  completed  and  successfully  demonstrated  in 
late  1973.  Its  major  characteristics  are  described  in  -this  section. 

2.2.1  System  Overview 

The  overall  configuration  of  VDMS  is  characterized  by  three  major  processing 
modules : 

1)  The  linguistic  processor,  which  contain**  the  parser  and  a  discourse- 
level  controller? 

2)  The  acoustic-phonetic  processor,  whose  results  are  contained  in  an 
array  of  dat?.  called  the  A-matrix? 

3)  The  lexical  matching  procedure,  which  performs  matches  of  predicted 
words  at  the  syllable  level,  using  various  applications  of  phono¬ 
logical  rules  to  assist  in  its  matchings. 

The  pattern  of  communication  among  these  modules  is  illustrated  in  Figure  2-1. 
The  speech  from  the  user  is  input  to  the  acoustic-phonetic  processor,  which 
forms  an  array  of  acoustic-phonetic  data  for  use  by  the  parser.  At  the  begin¬ 
ning  of  the  processing  of  an  utterance,  the  discourse-level  controller  provides 
a  variety  of  predictions  and  restrictions  on  what  is  allowable  or  expected  in 
this  utterance.  The  predicted  words  are  transmitted  to  the  lexical  matching 
procedure,  which  looks  for  the  words  in  the  acoustic-phonetic  data.  The  parser 
and  lexical  matching  procedure  then  pass  predictions  and  verifications  back  and 
forth  to  one  another  in  an  effort  to  understand  the  utterance.  Once  it  is 
understood,  the  utterance  is  passed  to  the  data  management  system,  which  forms 
an  appropriate  response.  The  response  is  then  passed  to  the  discourse-level 
controller  and  to  the  user.  The  discourse-level  controller?  is  thus  updated 
to  aid  in  future  predictions . 

The  logical  flow  of  control  and  data  between  all  of  the  modules  is  specified  by 
a  language  unique  to  VDMS,  nailed  the  Control  Structure  Language  (CSL) .  Using 
CSL  for  program  control,  new  moduies  may  be  implemented,  and  data  paths  among 
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Figure  2-1.  Overview  of  VDMS 
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modules  in  VDMS  may  be  modified  without  major  reprogramming  of  the  systtm.  In 
addition,  CSL  has  the  following  features: 

1)  It  allows  for  logical  parallel  execution  of  modules. 

2)  It  provides  for  the  running  of  modules  on  remote  computers. 

3)  Using  a  trace  and  debugging  provision,  breakpoints  can  be  inserted 
for  monitoring  the  flow  of  data  through  the  system. 

4)  Charges  m  the  order  of  execution  of  the  various  module?*  may  be 
specified. 

5)  Data  dependencies  among  modules  may  be  controlled. 

A  more  detailed  discussion  of  CSL  is  given  by  Barnett  [2J . 

2.2.2  The  Pis course -Level  Controller 


The  discourse -level  controller  comprises  two  modules:  the  user  model  and  the 
thematic  memory.  The  user  model  determines  what  query  "state"  the  user  is  in 
and  predicts  the  kinds  of  grammar  that  may  appear  in  his  next  interaction  with 
the  system.  Some  sample  "states"  are  system  login,  interactive  query  mode, 
report  generation  mode,  and  user  aids.  If  the  user  is  in  interactive  query 
mode,  the  user  model  will  predict  syntax  equations  for  the  next  interaction, 
such  as  those  for  Print,  Repeat,  Count,  Subset,  or  Total,  each  of  which  is  the 
first  word  of  an  interactive  query  statement.  The  words  Explain  and  Describe 
are  the  first  words  of  typical  ucer-aid  commands.  Each  prodiction  carries 
with  it  a  confidence  level  such  that  the  higher  the  cenfidonce  level,  the "more 
liberal  the  system  will  be  in  overlooking  errors  in  recognition. 

T)  thematic  memory  is  concerned  with  particular  content  words  that  might  occur 
in  the  next  utterance;  it  is  not  concerned  with  syntactic  terminals  such  as  the 
digits  or  the  word  "Print,"  Several  pieces  of  information  are  kept  about  each 
word  as  it  is  used:  e.g.  ,  how  long  (how  many  utterances  ago)  it  has  been  since 
the  word  was  used  and  how  likely  it  is  that  the  word  will  re-occur,  depending 
on  how  it  was  used  originally.  For  example,  if  the  user  said  "Show  category," 
the  assumption  is  that  the  next  conmand  will  probably  involve  something  about 
the  categories  of  submarines  in  the  data  base. 

In  addition  to  looking  for  content  words  in  user  commands,  the  thematic  memory 
also  keeps  a  record  of  any  non-numeric  symbolic  responses  from  the  data  manage¬ 
ment  system.  These  responses  are  also  used  to  predict  words  that  are  highly 
likely  to  occur  in  the  next  utterance. 

throughout  a  dialogue,  the  various  content  words  as  predicted  by  the  thematic 
memory  are  "aged"  from  utterance  to  utterance,  and  their  likelihood  of  being 
used  is  diminished  if  they  have  not  been  reused.  If  the  "Confidence  of 
Prediction"  drops  below  a  threshold,  then  the  word  i3  removed  from  the  thematic 
memory  and  dropped  from  consideration  until  it  is  used  again.  Also,  if  duw 
plicate  entries  occur  within  an  utterance,  then  the  age  and  original  merit  are 
modified  to  take  care  of  this  effect. 
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2.2.3  The  Parser 


The  basic  linguistic  unit  used  'or  the  parsing  strategy  in  VDMS  is  tho  phrase, 
which  consists  of  one  or  more  vocabulary  words  (up  to  the  complete  utterance) 
linked  together  in  a  syntactically  and  semantically  correct  order.  Some 
examples  of  phrases  are  "country  and  category"  and  "quantity  equals  five." 

The  parser  attempts  to  predict  pnrases  using  the  user-  odel,  thematic-patterning, 
and  grammatical  and  semantic  constraint  s  information  provided  by  the  discourse- 
level  controller  Predicted  phrases  are  matched  against  the  acoustic-phonetic 
data  for  ac-'eptance  or  rejection.  Accepted  phrases  are  then  concatenated  to 
form  a  larger  phrase,  which  is  'hen  analyzed  to  see  wi  cher  it  it  a  complete 
utterance. 

The  parser  consists  of  four  major  modules: 

1)  The  classifier 

2)  The  bottom  driver 
' )  T*  top  driver 

4)  The  side  driver 

The  classifier's  task  is  to  assign  a  syntactic  category  each  word  accepted  by 
the  lexical  matching  procedure.  Some  typical  syntactic  categories  and  examples 
are: 


Item  name  {"country") 

IteJi  value  ("US?") 

Syntactic  terminal  ("print") . 

A  syntactic  category',  as  generated  by  the  classifier,  is  used  by  rha  other 
modules  of  the  parser  to  generate  pr ‘dictions  about  allowable  syntax  in  other 
parts  of  the  utterance.  The  bottom  driver  is  a  typical  "bottom-up"  module, 
whicn  takes  found  phrases  and  determines  how  they  may  be  used  in  completing 
the  parsing  of  a  complete  utterance.  The  top  driver  takes  predicted  phrorcs 
and  from  them  derives  either  a  syntactic  terminal  or  a  shorter  phrase  to  be 
looked  for  next.  The  syntactic  terminals  arc  sent  tc  the  lexical  matching 
procedure,  which  then  attempts  to  match  each  one  against  the  acoustic-phonetic 
data.  The  side  driver  takes  completed  or  partially  completed  phrases  from  the 
bottom  driver.  If  a  phrase  is  incomplete,  the  side  driver  determines  which 
part  to  look  for  next,  and  it  will  ask  the  top  driver  tc  locate  the  missing 
part.  On  the  other  hand,  if  the  phrase  has  been  ;d,  the  side  driver 

analyzes  it  t*.  see  if  it  is  a  legal  complete  utterance.  If  it  is,  the  side 
driver  terminates  the  parsing  activities  of  all  modules  and  transmits  the 
syr'  olic  form  of  the  hypothesized  utterance  to  the  data  management  system 
enc  the  discourse-level  controller.  Other  completed  phrases  (which  do  not 
cc  »r  the  ertire  ut  'ranee)  are  used  to  "bectom-drive"  the  entire  system  or 
"bottom-drive"  tha  artial  prrse  up  one  Invei  to  create  larger,  mere  cc  'plete 
phrases.  The  flew  o  pr  ssing  within  the  parser  is  shown  in  Figure  2-*. 
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Figure  2-2,  The  Parser 
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2.2.4  The  Lexical  Matching  Procedure 

The  lexical  matching  procedure  verifies  or  rejects  a  predicted  word  through 
pattern-matching  against  the  available  acoustic-phonetic  data.  A  detailed 
description  of  the  procedure  is  given  by  Weeks  [3] . 

The  syllable  is  the  unit  that  is  used  in  the  lexical  matching  process.  The 
linguistic  issues  concerning  its  existence  or  form  have  been  sidestepped  by 
giving  it  an  algorithmic  definition.  All  words  have  syllable  divisions 
marked  in  the  lexicon,  and  seme  of  the  phonemic  rules  cure  written  in  terms 
of  these  boundaries.  Within  a  syllable,  most  of  ■che  co-articulation  is 
internal;  effects  over  boundaries  are  handled  separately.  Because  a  good 
percentage  of  phonetic  dependencies  occur  within  these  units,  rules  can  be 
conveniently  applied.  Under  this  approach,  a  separate  set  o :  rules  must  be 
set  up  for  dealing  with  interactions  over  boundaries.  Word  boundaries  can 
then  be  considered  as  special  cases  of  syllable  boundaries. 

Figure  2-3  is  a  b! >ck  diagram,  of  the  lexical  matching  procedure.  When  a  word 
is  predicted  in  orthographic  form,  its  pho  emic  representation  is  extracted 
from  the  lexicon.  A  set  of  phonemic  rules  is  applied  to  the  phonemic  rep¬ 
resentation  to  obtain  a  set  of  lexical  variants.  These  variants  arise  by 
phonemic  replacements  as  dictated  by  the  rules.  The  set  of  lexical  variants 
is  then  sent  to  the  main  matching  procedure,  where  each  is  matched  one  by  one 
against  the  acoustic-phonetic  data  on  a  syllable-by-syllable  basis.  The 
boundary  analysis  is  done  ir.  conjunction  with  the  syllable  matching  and  attempts 
to  compensate  for  articulation  across  syllable  and  word  boundaries .  The 
resulting  scores  for  each  lexical  variant  are  then  sent  to  the  main  matching 
procedure,  which  decides  which  possibility  gives  the  best  over-all  score.  The 
structure  of  the  phonemic  rules  pass  is  described  by  Barnett  [4]. 

2.2.5  The  Acoustic-Phonetic  Processor 


Speech  is  input  interactively  in  a  very  quiet  sound  booth  (signal-to-noise 
r^tio  better  than  50  dB)  us^ng  a  Sony  ECM-377  condenser  microphone,  whit* 
has  an  essentially  flat  frequency  response  to  beyond  10,000  Hz.  Low-level 
preemphasis  is  employed,  shaping  the  frequency  response  with  a  zero  at 
300  Hz  and  a  pole  at  3,000  Hz.  This  speech  signal  is  bandlimited  to  9,000  Hz 
and  digitized  at.  20,000  samples  per  second  using  a  12-bit  analog -to- digital 
converter.  The  input  speech  is  saved  directly  on  digital  media  with  no 
intervening  analog  recording  steps. 


The  digitized  speech  is  then  passed  through  a  digital- to- analog  converter, 
and  the  resulting  analog  waveform  is  passed  through  three  hardware  filters 
having  nominal  bandpasses  of  150  to  900  Hz,  900  to  2,200  Hz,  and  2,200  to 
5,000  Hz,  respectively.  Over  10-msec,  intervals,  two  parameters  are  extracted 
from  each  of  the  three  filter  outputs:  the  maximum  peak-to-peak  amplitude  and 
a  count  of  zero  crossings.  The  resulting  six  parameters  (two  fretn  each  of  the 
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Figure  2-3.  Lexical  Matching  Procedure 
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three  filtered  signals)  are  used  to  assign  a  rough  acoustic  label  to  each 
10-msec,  segment.  Five  labels  are  currently  used:  VW  (vowel-like) , 

SS  (strong  frication) ,  SI  (silence),  UV  (low-amplitude  voiced  or  unvoiced), 

VC  (all  other- -usually  weak  voicing) .  The  next  step  in  the  processing  is 
to  refins  these  rough  labels,  i.e.,  impose  on  each  10-msec,  segment  a  more 
accurate  label  than  the  five  above. 

Within  the  classes  VW  and  VC,  more  specific  labels  are  assigned  using  a  vowel- 
reccgnition  strategy  ba^ed  on  speaker-dependent  vowel  formant  information. 

This  information  is  compared  with  the  formants  (obtained  with  the  use  of  a 
Linear  Predictive  Coeff4.cient  (LPC)  spectrum)  at  selected  instants  in  time  for 
each  vowel  to  be  identified.  A  modified  Euclidean  distance  function  is  used 
to  compute  the  relative  distances  between  the  candidate  formant  values  and  the 
p re-stored  speaker-dependent  vowel  formant  values.  The  closest  three  vowels 
are  selected,  and  associated  scores  are  assigned  to  these  choices  based  on  the 
values  of  the  distance  function. 

Fricatives  and  plosives  are  characteristically  found  within  sequences  of 
segments  labeled  SS,  VC,  or  UV.  For  these  areas,  a  technique  called  the  Low- 
Coefficient  LPC  (LCLPC) ,  described  by  Molho  [5] ,  has  been  shown  to  provide 
meaningful  spectra  that  correspond  well  with  both  acoustic-phonetic  theory 
and  with  the  experimental  results  of  others.  Time  resolution  is  sufficiently 
narrow  to  allow  independent  spectral  analysis  of  the  release,  frication,  and 
aspiration  portions  of  an  unvoiced  plosive  or  to  demonstrate  spectral  change 
within  a  consonant  cluster,  so  that  clusters  such  as  As/  and  /ts/  may  often 
be  distinguished.  For  analysis  of  unvoiced  speech,  the  LCLPC  uses  the  auto¬ 
correlation  method  with  eigut  coefficients  and  a  6-msec.  Hamming  window. 
A’^lysis  of  spectra  obtained  in  this  way  allows  the  following  five  classes 
co  be  distinguished: 

1)  labial  or  dental  (LD) 

2)  alveolar  (AL) 

3)  alveopalatal  (AP) 

4)  palatal  or  velar  (PV) 

5)  voiced  or  low  energy  (VS) 

These  classes  correspond  roughly  to  the  spectral  characteristics  of  unvoiced 
fricatives  and  plosives.  Moreover,  there  is  a  correspondence  between  these 
classes  and  tha  articulatory  positions  of  unvoiced  fricatives  and  plosives. 

The  classes  LD,  AL,  AP,  and  PV  ideally  contain  the  following  phonemes: 

LD:  /P/,  /f/,  A/ 

AL:  /t/,  /s/ 

AP:  /// 

PV :  A/ 
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Experimentation  has  also  confirmed  that  the  glide  /w/  and  the  liquid  /l/ 
characteristically  occur  within  the  VW  or  VC  classes.  The  present  approach 
to  recognizing  these  phonemes  is  to  augment  a  speaker's  vowel  formant  table 
with  the  formant  frequency  values  for  /w/  and  /l/.  These  formant  values 
have  consistently  been  easily  distinguishable  from  the  forma  its  of  the  vowels 
and  have  enabled  the  system  to  accurately  isolate  and  recognize  /w/  and  /l/. 

The  glide  /y/  and  the  liquid  /r/  are  handled  indirectly,  again  with  the  use 
of  the  speaker-dependent  vowel  formant  table:  if  a  lC-msec .  segment  has  been 
labeled  /i/,  it  is  assumed  that  the  segment  could  be  a  /y/  with  equal  prob¬ 
ability,  and  both  labels  are  then  assigned  to  the  segment  with  the  same  score. 
If  a  segment  has  been  labeled  /£/  (again  with  the  aid  of  the  vowel  table) ,  the 
label  /r/  is  assigned  to  the  same  segment  with  an  equal  score. 

Although  the  system  is  not  yet  able  to  distinguish  the  various  elements  within 
tiic  class  of  nasals,  viz.,  /m/,  /n/,  ,  i/»  /m/,  /n/,  a  single  class  name  (NA) 
is  used  and  has  proved  quite  reliable .  A  segment  is  labeled  NA  based  upon  some 
simple  tests  involving  the  amplitudes  and  bandwidths  of  F..  ,  F^,  and  F^.  All 
of  the  aforementioned  segment  labeling  procedures  are  used  to  construct  an 
array  of  acoustic-phonetic  data  called  the  A-matrix.  The  construction  of  the 
A-mafcrix  is  shown  in  Figure  2-4.  Each  row  of  the  A-matrix  corresponds  to  a 
10-msec,  segment  of  speech  and  contains  a  rough  segment  label  (VW,  SS,  SI,  VC, 
or  UV) ,  one  or  more  refined  segment  labels  and  associated  scores  based  on  the 
above  procedures,  formant  frequency  values,  and  estimates  of  fundamental 
frequency,  RMS  energy,  and  other  acoustic-phonetic  parameters  used  in  the 
assignment  of  the  phoneme  ar.d  phoneme-class  labels . 

2.2.6  System  Testing 

In  the  present  configuration,  speech  is  digitally  recorded  and  saved  on  disk, 
as  described  above,  usinc  a  Raytheon  704  computer.  The  same  computer  then 
creates  an  A-matrix  from  he  digitized  waveform.  The  A-matrix  is  then  sent 
(via  a  direct  hardware  link)  to  an  IBM  370/145  computer,  which  then  performs 
all  subsequent  lingustic  processing  of  the  utterance  and  returns  a  response 
to  the  user.  Since  the  system  is  dependent  upon  thematic  patterning  to  aid 
in  the  understanding  of  an  utterance,  it  is  necessary  for  the  user  to  inter¬ 
act  with  VDMS  using  goal-directed  dialogues.  For  purposes  of  testing,  ten 
dialogues  were  created  with  an  average  of  ten  utter ances  per  dialogue. 

Each  of  two  male  speakers  (for  when  vowel  formant  tables  had  been  praviously 
constructed)  recited  the  sets  of  dialogues.  In  this  initial  test  of  TOMS,  an 
average  of  52%  of  the  utterances  were  correctly  understood.  Analysis  of  these 
preliminary  results  has  shown  that  this  figure  can  be  increased  by  correcting 
program  errors  and  implementing  some  modifications  to  the  phonological 
processes  and  lexical  matching  procedure. 
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Figure  2-4.  A-Matrix  Processing 
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2.2.7  Related  Research 

An  experiment  was  designed  to  compare  the  formant  1  (Fl)  and  formant  2  fF2) 
frequency  movements  of  vowels  next  to  /r/  with  the  same  vowels  before  other 
consonants  [6].  Lehiste's  data^  (obtained  from  spectrograms )  on  the  vowel 
allophones  associated  with  /r/  were  used  for  comparison  purposes.  The  data 
for  this  experiment  were  based  on  formant  trajectories  computed  by  LPC 
techniques  on  the  Raytheon  704  computer. 

The  results  of  this  experiment  oonfirmed  Lehiste's  work,  which  indicates 
that  there  is  a  change  in  seme  vowels  in  a  retroflexed  environment.  The 
change  in  vowels  after  /r/  is  minimal  except  for  /i/,  but  the  change  in 
vowels  before  /r/  is  considerable . 

This  was  a  preliminary  experiment  in  #hich  the  number  of  subjects  and  samples 
was  smali .  However,  the  results  can  be  used  to  develop  a  retroflexed  vowel 
space  on  the  basis  of  a  non-retroflexed  vowel  space  and  to  compare  the  identi¬ 
fication  of  vowels  using  this  new  F1-F2  space  with  the  identification  of  vowels 
using  the  non-retroflexed  F1-F2  space. 

An  algorithm  was  designed  *  t  automatically  distinguishes  the  nasals  /n/, 

/m/,  and  /n/  from  each  other  [7] .  Spectral  analysis  is  performed  on  the  Raytheon 
computer  using  an  LPC  model  to  locate  the  formants  of  these  phonemes.  By 
comparing  the  formar.t  frequencies  of  unknown  nasals  to  prototype  values 
derived  from  normalization  utterances,  the  algorithm  was  able  to  correctly 
identify  nasals  in  72%  of  the  cases  tested.  Experimentation  has  indicated 
that  (1)  automatic  techniques  can  be  employed  to  distinguish  nasals  in  con¬ 
tinuous  speech;  (2)  linear  prediction  can  be  used  to  effectively  analyze  the 
spectra  of  these  phonemes;  and  (3)  speaker- dependent  tables  of  prototype  nasal 
formants  extend  these  results  to  multiple-speaker  environments. 

2.3  PLANS 

The  original  goal  to  be  reached  by  the  end  of  the  current  contract  year  (as 
specified  in  SDC  Proposal  73-5674)  consisted  of  enlarging  the  vocabulary  of 
the  query  language  and  loosening  the  grammar  to  make  the  language  easier  and 
more  natural  to  use.  Specifically,  VDMS  was  to  contain  a  vocabulary  of  about 
500-600  words,  and  the  grammar  was  to  be  modified  to  admit  the  following 
capabilities : 

1)  More  natural  form  of  expression  for  integers  (example:  "thirty- 
four"  instead  of  "three  four"). 

2)  Facility  for  inter-item  comparisons  (example:  "Print  category 
where  surface  speed  greater  than  submerged  speed"). 

3)  Use  of  strings  of  inequalities  (example:  "Print  type  where  draft 
greater  than  seven  and  less  than  nine"). 


I.  Lehiste,  Acoustical  Characteristics  of  Selected  English  Consonants, 
Mouton  &  Company,  The  Hague,  1964. 
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4)  Simple  arithmetic  calculations  (example:  "Print  category  where 
surface  speed  greater  than  three  times  submerged  speed"). 

5)  Interrogative  sentences. 

However,  recent  discussions  and  negotiations  (at  the  request  of  ARPA)  with  the 
Stanford  Research  Institute  (SRI)  have  yielded  a  cooperative  operating  plan  in 
which  this  original  goal  has  been  changed.  We  believe  that  our  proposed 
modifications  for  September  1974  will  provide  a  stronger  base  from  which  to 
proceed  to  our  long  range  goal,  viz.,  the  five-year  system  as  characterized  by 
Newell,  et  al.1  The  modifications  are  summarized  below  and  described  in 
SDC's  current  proposal  to  ARPA  (SDC  Proposal  74-5490,  15  April  1974). 

Two  major  systems  jure  planned  for  the  end  of  the  current  contract  year 
(September  1974) : 

1)  Version  A,  which  will  have  a  vocabulary  of  about  150  words,  an 
English-like  grar/na^  (with  the  aid  of  the  SRI  parser) ,  and  all 
other  features  similar  to  the  present  VDMS. 

2)  Version  B,  which  will  be  the  present  VDMS  with  a  vocabulary  of 
about  300  words. 

Version  A  will  provide  the  first  step  toward  a  natural  English  VDMS.  The  main 
use  of  Version  B  will  be  :o  test  the  robustness  of  the  acoustic-phonetic 
algorithms  on  a  larger  vocabulary. 

The  following  tasks  are  currently  in  process  to  aid  in  the  construction  of 
Version  A: 

1)  System  Architecture — The  design  concepts  underlying  the  system 
developed  by  SDC  and  SRI  during  the  .irst  two  years  of  the  ARPA 
program  were  sufficiently  similar  that  it  seemed  reasonable  to 
develop  a  single  system  that  would  combine  the  procedure  for 
acoustic,  phonetic,  and  phonological  analysis  developed  by  SDC 
with  the  procedures  for  syntactic,  semantic,  and  pragmatic 
analysis  developed  by  SRI.  To  accomplish  this  goal,  we  are 
re-analyzing  the  design  concepts  of  each  and  developing  a  new 
system  design.  (Responsibility:  SDC,  SRI) 

2)  Parsing — The  design  of  the  SRI  parser  is  being  revised  to 
accommodate  the  SDC  procedures  for  acoustic-phonetic  analysis 
and  lexical  mapping.  Changes  also  may  result  from  additions 
to  the  grammar.  An  initial  version  of  the  revised  parser  will 
be  operating  in  the  new  system  by  September.  (Responsibility: 

SRI) 


^Speech-Understanding  Systems:  Final  Report  of  a  Study  Group. 
Pittsburgh,  Carnegie -Me lion  University,  1971. 
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3)  Grammar — A  major  reorganization  of  the  SRI  grammar  is  being  made. 

The  rules  included  in  the  initial  revision  will  be  influenced 

by  dialogues  collected  during  the  early  protocol  experiments 
(see  below) .  The  structure  of  the  grammar  is  being  changed  so 
that  additions  and  modifications  can  be  made  more  easily. 
(Responsibility:  SRI) 

4)  System  Software — Additions  to  the  system  software,  particularly 
extensions  and  modifications  of  SDC  LISP,  will  be  made  as 
required.  (Responsibility:  SDC) 

5)  System  Hardware — Work  is  proceeding  on  the  ARPANET  interface  for 
the  SDC  computer  facility.  As  the  special-purpose  signal  processing 
and  acoustic  analysis  oomputers  are  acquired,  they  will  be  interfaced 
into  the  system.  (Responsibility:  SDC) 

6)  Lexical  Matching  Procedure — Substantial  changes  will  be  made  for 
coordination  with  the  SRI  parser  and  grammar  for  Version  A. 
(Responsibility:  SDC) 

7)  Protocol  Experiments — Persons  are  being  located  who  are  thoroughly 
familiar  with  information  retrieval  operations  on  data  of  the 
type  contained  in  the  submarine  task  domain.  On  the  basis  of 
interviews  with  them,  a  set  of  tasks  will  be  identified  and  used 
in  eliciting  task-oriented  dialogues.  Early  result®  will  bs  used 
immediately  to  determine  additions  to  the  vocabulary  and  tc  the 
data  base.  Subsequent  analyses  will  /uide  revisions  to  the  grammar 
and  the  design  of  an  effective  discourse  modal.  (Responsibility: 

SRI) 

8)  Semantics-  -Major  changes  in  the  semantics  developed  for  the  original 
SRI  task  domain  will  be  made,  initially  to  accommodate  the  data 
management  task.  Subsequently,  as  the  data  base  is  extended  in 
accord  with  the  findings  from  the  protocol  experiments ,  additional 
modifications  will  be  made.  (Responsibility:  SRI) 

The  following  tasks  are  currently  in  process  to  aid  in  the  construction  of 
Version  B: 

1)  Acoustic-Phonetic  Analysis — The  algorithms  used  to  build  the 
A-matrix  for  an  utterance  are  being  refined  and  extended,  and 
new  procedures  are  being  added  to  improve  the  accuracy  of  the 
classifications.  (Responsibility:  SDC) 

2)  Lexical  Matching  Procedure — The  current  lexical  matching  procedure 
in  VDMS ,  with  minor  modifications,  will  be  adequate  for  testing 
the  system  with  the  extended  vocabulary.  (Responsibility:  SDC) 
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3.  LEXICAL  DATA  ARCHIVE 

3.1  INTRODUCTION 

The  Lexical  Data  Archive  (LDA)  project  has  addressed  itself  to  the  task  of 
providing  the  ARPA  SUR  project  contractors  with  semantic  and  syntactic  data 
for  the  words  in  their  lexicons.  Being  devoted  exclusively  to  lexical  research , 
LDA  can  assure  a  broad  range  of  services  for  each  SUR  project  without  the  trip¬ 
lication  of  effort  and  resources  which  would  be  required  if  the  three  SUR 
projects  were  to  perform  the  same  tasks.  LDA  will  monitor  a  broad  range  of 
lexical  data  sources,  select  the  data  having  potential  payoff  for  speech  under¬ 
standing,  format  that  data  for  archiving  purposes,  and  provide  for  its  dissem¬ 
ination  to  the  appropriate  SUR  projects . 

3.2  PROGRESS  AND  PRESENT  STATUS 

Since  the  initiation  of  LDA  in  September  1973,  the  following  achievements  have 
been  realized.  First,  the  design  of  the  Semantically-Oriented  Lexical  Archive 
(SOLAR)  has  been  completed.  This  task  included  deciding  what  types  of  lexical 
data  to  collect,  determining  the  data  collection  procedures,  writing  the 
specifications  of  the  programs  needed  to  extract  data  from  machine  transcripts, 
and  designing  the  logical  structure  of  the  files  to  be  built.  In  accordance 
with  questionnaire  responses  from  the  SUR  projects,  SOLAR  will  consist  of  eight 
files . 

1)  A  word  index  will  allow  a  user  to  easily  determine  the  types  of  data 
available  for  a  given  word. 

2)  A  bibliographic  reference  file  can  be  used  in  conjunction  with  other 
files,  to  allow  abbreviated  refex'ences  and  can  also  be  used  as  a 
separate  resource  for  guidance  into  the  literature. 

3)  A  file  of  semantic  analyses  will  contai  ;i  formal  treatments  of  the 
semantic  properties  of  individual  words  as  found  in  the  literature. 
Notes  will  also  be  provided  explaining  the  descriptive  constants 
employed  in  these  analyses. 

4)  Integrated  summaries  of  analyses  given  in  the  philosophical  literature 
for  concepts  central  to  the  descriptive  constants  used  in  the  semantic 
analyses  will  be  constructed. 

5)  Collocational  information  found  in  the  definitions  of  Webster's  Seventh 
New  Collegiate  Dictionary  (W7)  is  to  be  machine-extracted  and  made 
accessible  via  the  words  to  which  it  pertains. 

6)  Definitional  expansions  controlled  by  the  words  within  a  particular 
SUR  lexicon  will  be  constructed. 
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7)  Semantic  fields  will  be  built  for  ench  SUR  word  by  tying  to  them 
the  words  found  in  certain  definitional  and  synonym! tive 
relationships  Jn  W7  and  Roget's  International  Thesaurus  (Roget) . 

8)  Finally,  every  context  of  each  SUR  word  in  the  W7  defir.itioas  and 
in  the  Brown  Corpus  will  be  entered  in  a  key-word-in-context  (KWIC) 
file. 

For  a  more  detailed  discussion  of  the  contents  of  each  of  these  files  see  Dillor 
and  Olney  [1] . 

A  second  achievement  has  been  the  collection  of  a  significant  amount  of  data 
for  the  first  three  files  just  described.  All  of  the  SUR  words  in  use  as  of 
about  January  1974  have  been  entered  into  the  word  index,  together  with  their 
W7  parts  of  speech  and  an  indication  of  the  SOLAR  data  available  for  each. 

About  2000  references  to  linguistic  and  philosophical  documents  have  been 
entered  into  the  bibliographic  file.  Indexing  by  author,  title,  and  keyword 
(among  other  parameters)  is  possible  for  each  bibliographic  entry.  Approx¬ 
imately  70  semantic  analyses  have  been  entered  into  the  third  file.  A  list 
of  the  words  to  which  these  analyses  apply  is  given  in  Section  3.2.1.  Section 
3.2.2  presents  four  semantic  analyses  of  'give'  as  they  currently  appear  in 
the  archive.  In  Section  3. 2. 3  notes  on  the  descriptive  constants  used  in  one 
of  those  analyses  are  given.  Without  such  notes,  some  of  the  analyses  would 
be  virtually  incomprehensible  (e.g.,  the  first  one,  which  is  taken  from  Bendix's 
dissertation).  Section  3.2.4  presents  three  conceptual  analyses,  two  of  which 
are  especially  relevant  to  descriptive  constants  used  in  Bendix's  semantic 
analysis  of  'give',  viz.,  'cause (s) '  and  the  sequence  ' (C  AN-F) ' .  Bendix's 
comments  on  these  onstants  (see  our  explanatory  notes  E-113,  E-115,  and  E-118 
in  Section  3.2.3)  include  the  following  reservations:  "Ultimately  we  would 
want  to  define  the  metalinguistic  term  'cause'  more  precisely.  This  would  be 
intended  as  a  definition  not:  of  causation  but  of  what  we  claim  a  speaker  is 
asserting  when  he  uses  a  form  whose  meaning  we  define  as  containing  the  component 
'cause'."  While  we  certainly  grant  that  the  task  of  defining  Bendix's  meta¬ 
linguistic  term  'cause'  is  distinguishable  from  the  task  of  defining  causation, 
we  question  whether  the  former  task  can  be  accomplished  successfully  in  total 
isolation  from  the  latter.  3endix's  use  of  'cause*  as  a  metalinguistic  term  in 
his  semantic  analysis  of  'give'  is  typical  of  the  recent  practice  of  linguists 
in  that  the  meaning  he  defines  for  that  term  corresponds  close1 v  to  a  sense  of 
the  English  word  'cause'  (see  p.  29  of  ilj).  Under  these  circumstances  the 
attempt  to  make  the  conceptual  content  of  tne  metalinguistic  term  more  precise 
almost  inevitably  leads  into  an  investigation  of  the  conceptual  content  of  the 
corresponding  word  sense,  if  only  for  the  sake  of  differentiating  them  more 
sharply. 

As  with  a  great  many  of  the  metalinguistic  terms  used  by  linguists  for  semantic 
analysis,  the  conceptual  content  of  the  word  sense  corresponding  to  Bendix's 
'cause'  has  been  investigated  extensively  by  analytic  philosophers.  It  is 
noteworthy  in  this  connection  that  Bendix's  semantic  analysis  of  'give'  suffers 
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from  a  defect  that  is  highlighted  by  philosophical  analyses  of  ' cause'  and  the 
closely  related  notion  'intentioral  action'.  The  essence  of  Bendix's  analysis 
of  'give'  is  the  claim  that  'C  gives  B  to  A'  is  equivalent  to  'C  causes  (A  has 
13)*.  However,  it  is  certainly  false,  if  C  gives  B  to  A,  that  any  kind  of  causal 
connection  between  something  C  does  and  A's  subsequently  having  B  will  suffice. 
Clearly  what  C  must  do  is  perform  an  intentional  action  which,  C  anticipates, 
will  result  in  A's  having  B  (see  the  integrated  summary  of  conceptual  analyses 
of  ' intentional  action'  (1-103)  for  details).  In  other  words,  Bendix's  sanantic 
analysis  of  'give'  should  be  augmented  by  a  clause  containing  a  descriptive 
constant  (^metalinguistic  term)  that  invokes  a  notion  of  intentional  action. 

In  addition  to  making  it  easier  for  linguists  to  recognize  such  omissions  in 
their  semantic  descriptions,  our  integrated  summaries  of  conceptual  analyses 
will  give  them  a  head  start  in  working  out  adequate  definitions  of  the  descrip¬ 
tive  constants  they  decide  to  use.  The  complexity  illustrated  by  the  summary 
we  give  for  'intentional  action'  is  typical  of  the  conceptual  content  of  basic, 
commonplace  notions  into  which  many  word  meanings  seem  readily  decomposable. 

There  is  a  striking  parallel  between  the  ease  with  which  a  person  c^n  use  such 
notions  appropriately  in  thinking  and  communi eating  without  being  able  to  specify 
their  conceptual  content  in  ietail  and  the  ease  with  which  he  earn  utter  grammat¬ 
ically  correct  sentences  without  being  able  to  specify  their  grammatical 
structure  in  detail. 

3.2.1  Words  Analyzed  and  Sources 

Semantic  analyses  have  been  entered  for  the  50  words  following.  Since  s^s 
words  have  more  them  one  analysis,  approximately  70  analyses  have  been  completed 
for  these  50  words. 

all,  and, any,  assemble,  bear,  bring,  carry,  come,  conduct,  cross, 
descend,  disperse,  drive,  drop,  eight,  emerge,  enter,  escape, 
excuse,  five,  flow,  four,  get,  give,  go,  head,  lead,  leak,  may, 
move,  navigate,  nine,  part,  penetrate,  proceed,  quit,  sail,  scale, 
scatter,  separate,  seven,  six,  slip,  some,  start,  submerge,  take, 
three,  turn,  two 

The  analyses  are  taken  from  the  following  sources: 

Bendix,  Edward  (1966) .  Componential  analysis  of  General  Vocabulary:  The 
Semantic  Structure  of  a  Set  of  Verbs  in  English,  Hindi ,  and 
Japanese .  Bloomington,  Indiana:  Indiana  University  Press. 

Dixon,  R.  M.  W.  (1973).  "The  Semantics  of  Giving,  '  in  Tne  Formal  Analysis  of 
Natural  Languages.  Ed.  t>y  Gross,  Maurice  et  al.  The  Hague:  Mouton. 

Fillmore,  Charles  (1969).  "Verbs  of  Judging:  An  Exercise  in  Semantic 
Description,  '  in  Papers  in  Linguistics  1.1,  pp.  91-117. 
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Ikegami,  Yoshihiko  (1969).  The  Semiological  Structure  of  the  English  Verbs 
of  Motion. 


ackf nd"ff ,  Ray  Q971) .  "Modal  Structure  in  Semantic  Representation," 

Lin<fuistic  Inquiry  2.4,  pp.  479-514. 

Katz,  Jerrold  (1972).  Semantic  Theory.  New  York:  Harper  and  Row. 

Staal,  J.  F.  (1968).  "And,"  Journal  of  Linguistics,  Vol.  4,  pp.  79-81. 

It  should  be  emphasized  that  the  sources  do  not  reflect  a  bias  that  will  be 
maintained  in  the  project.  An  exceedingly  large  number  of  references  relevant 
to  the  EUR  words  have  been  found,  and  they  will  be  worked  through  systematically, 
beginning  with  those  containing  words  occurring  in  most  of  the  SDR  lexicons. 

3.2.2  Sample  Semantic  Analyses 

Four  semantic  analyses  have  been  ertered  into  the  archive  for  the  word  'give'. 

These  have  been  taken  from  three  separate  sources. 

SOLAR  Word:  GIVE 
Domain:  AP 

S"  ALT.  WORDS 
VPCA3 

senanal  #:  S-102 

Source:  3FNDIX.  FDWARD  \  19661 

Accompanying  Words:  GET,  TAKE,  LOSF,  KEEP,  LEND,  GET  RID  PF,  BORROW, 
FIND,  BE,  HAVE 

Author's  Sense:  NO  RESTRICTION  OF  THE  SEJSE  TS  INDICATED 
W7  Sense:  HOMOGRA  PH  1,  VT  (SENSES  1  -  V>) 

Prelim.  Oual.  :  BFVDIX  IS  CONCERNED  WITH  THE  PARADIGM  "C  GIVES  A  B".  H 
PROCEEDS  ON  THE  HYPOTHESIS  mHI>T  'GIVF'  SIGNIFIES  A  CflANGF  OF 
STATS  INVOLVING  'HAVING*  AND  THEREFORE  INCLODES  THE  COMPONEN 
•A  HAS  P  '. 

Components :  »C  CAUSES  (B  AN-PH  A)  '  =  *C  CAUSES  A  TO  HA7F  B*  =  'C  CADS 
(A  HAS  B)  » 

Sntailments;  (NOW)  A  HAS  B 

Informal  Explic.:  "...WHEN  A  SPEAK 2R  ASSERTS  'C  GIVES  A  B',  WE  CLilH  ! 

IS  ASSERTING  THAT  THERE  IS  A  CAfTSAL  CONNECTION  FROM  SOMETHIN! 
•C»  DOFS  OR  IS  ( « C  CAUSES'  =  •  (C  AN-*)  CAOSES3)  TO  THE  ASS  ER' 
•A  HAS  F  f P».  69-70  1 

nnal  OiMlif.:  "...THERE  IS  NO  COMPONENT  f  T  NOICATING  1  '"HAT  *C»  HAS  *B 
BEFORE  GIVING  IT  TO  'A'..."  fP^  701  "NOTE  ALSO  "HAT  THE 
DEFINITION  TS  NOT  PHRASED  AS  *C  CAUSFS  ( A  GETS  f  P.  701 
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30LAB  Comments:  THE  TERM  'NOW'  IS  DSED  BT  BENDIX  TO  INDICATE  THE  CHANGE 
OF  CTA7F  INVOLVING  •HAVING’.  THE  ENTAILMENT  FRAN  E  •  (NON)  A  HAS 
B*  IS  SUGGESTED  FOR  *C  GIVES  A  3'.  -HON*  IS  NOT  POSITED  AS  A 
DESCRIPTIVE  CONSTANT,  HOWEVER.  INSTEAD,  ’AT  TIME  T*  IS  DEFINED 
r  P.  6  31  AND,  FOR  ’GIVE',  'A  HAS  B  AFTER  TIME  T'  IS  USED.  *  P. 

76  1  THE  RESULT  IS  THAT  NO  PORMAL,  EXPLICIT  DESCRIPTION  OF  TIHE 
OR  TENSP  HAS  BEEN  GIVEN.  3ENDIX  ADMITS  AS  MUCH  ON  P.  77,  VIZ. 
"T  IS  IMPORTANT  TO  POINT  OUT  THAT  TENSES  HAVE  NOT  BEEN 
SPECIFICALLY  CONSIDERED  IN  THIS  STUDY.  A  FULLER  DESCRIPTION  OP 
A  LANGUAGE  WOULD,  OF  COURSE,  HAVE  TO  GIVE  A  SEMANTIC  ANALYSIS 
OP  THE  SYSTEM  OF  TENSES  AS  WELL  AS  STATE  THE  COMBINATORIAL 
SEMANTIC  RULES  THAT  ACCOUNT  POP  THE  TEMPORAL,  ASPECTUAL,  AND 
OTHER  INTERACTTONS  BETWEEN  THE  VERBAL  MEANINGS  AND  THE  TENSE 
SYSTEM."  A  MOR.  SERIOUS  DEFECT  IS  BhNDT  X  *  FAILURE  TO  PERCEIVE 
THAT  THF  CAUSAL  CONNECTION  BETWEEN  THE  ACTION  OF  THE  GIVER  AND 
THE  TRANSFER  OF  POSSESSION  TO  THE  RECIPIENT  MUST  INCLUDE  A 
SPECIFICATION  OP  THE  GIVER'S  INTENTION.  SEE  CONCEPTUAL 
A  TV  ALY  S  FS  1-101  ('CAUSE')  AND  1-101  (•  INTENTIONAL  ACTION').  AN 
ANALYSTS  CONTAINING  'HAS*  AS  A  COMPONENT  MUST  BE  PREPARED  TO 
FACF  tr E  COMPLEXITIES  ARISING  WHFN  ABSTRACT  OR  NON-PHYSICAL 
OBJECTS  ARF  "GIVEN".  THUS,  'HAS'  MUST  BE  GIVEN  A  NON-POSSESSIVE 
INTERPRETATION  IN  SENTENCES  SUCH  AS  THE  FOLLOWING.  "SHE  GAVE  ME 
A  SHY  SMILE"  (CF.  W7  DEFINITION  OF  SENSE  SB  OF  'GIVE':  "TO 
F-ESENT  *"0  VTFW  OR  OBSERVATION"). 


Serna nal  #:  S-16R 

Source:  DIXON,  R.M.S.  \  19731  "THE  SEMANTICS  OF  GIVING" 

AccouDa  uvinq  Words:  DONATE,  PRESENT,  AWARD,  PAY,  LFND,  SELL,  RENT 
Author's  Sense:  "r  THIS  FIRST  SENSE  OF  "GIVE"  INDICATES)  GIVING  IN  RETURN 
FOP  SOMF  SERVICE  OR  0B.1EC7,  NORMALLY  ACCORDING  TO  A  DEFINITE 
CONTRACT  OP  AGREEMENT.  THUS:  Til  FRED  GAVE  ^OM  A 2  FOR  CUTTING 
T”E  LAWN,  r  2 1  JONES  GAVE  HIS  MEN  SO*  EXTRA  ^O^  WORKING  ON 
SUNDAY.  Ml  SALLY  GAVE  JOHN  TFp  CLOCK  FOP  SS.  \  '4  1  JOHN  GAVE 
SALLY  AS  FOR  t'HE  CLOCK."  \  P.  211) 

K7  Sens":  VT  10A:  TO  ^1  ELD  POSSESSION  OF  BY  WAY  OF  EXCHANGE:  PAY 
Pred  icate-Arqs  :  /GIVE1/  r  SUBJECT  1  rOBJFC'"!  r^O  IN  DTP  FCT  0B.7ECT1  r  FDR 
PHPASE  PY°RESSIN  G  THE  CONTRACT) 

Informal  Explic.:  "NUCLEAR  VERB  'GIVE'  HAS  SYNTACTIC  CONNECTION  WITH 

STUBJ  ECT  1,  OTEJECTl,  TO  If  NDIF  SCT  1  OfRJECTl  OBLIGATORILY;  FOP 
F.  /C)’  H  A  NOE  1  OOTTONALLY."  TP.  22?1 

Informal  ExDlic. :  "IN  THE  PRESENCE  OF.. .A  FOF-PFR  AS  E,  THE  INDIPECT 

OBJECT  ¥  AY  PE  0"  ITTED:  .  .  .  JOHN  GAVE  SS  FOR  THE  BOOK.  ...'GIVE' 
CANNOT  OCCUR  WITHOUT  A  DIRECT  OBJECT;  WE  DO  NOT  GET:  JOHN  GAVE 
FOR  TUI  BOOK."  TP.  2131 


Soman  al  r :  S-  IBS 

Source:  DTXOS,  R.M.W.  r 19731  "TPE  SEMANTICS  OF  GIVING" 

Accomnan vino  Words:  DONATE.  PRESENT,  AWARD,  PAY,  LEND,  SELL,  RENT 
Author's  Sense:  "f THIS  SECOND  SENSE  OF  'GIVE'  INDICATES  1  SPONTANEOUS 
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GIVING,  WHICH  IS  NOT  IN  DIRECT  CONTRACTUAL  SETTLEMENT  OF  ANY 
OBLIGATION  (ALTHOUGH  I  HAY  OP  COURSE  BE  INDIRECTLY  MOTIVATED  B 
SOME  GOOD  DEED  THAT  THE  RECI7*  ENT  HAS  AT  SOME  TIME  PER  FOR'*  ED)  . 
THUS:  m  TOM  GAVE  $  100  TO  THE  HOSPITAL.  T&1  MARY  GAVE  EFFIE 
HF.P  OLD  FUR  COAT."  [  P.  2111 

W7  Sense:  VT  1:  TO  MAKE  A  PRESENT  OF 

2 A:  TO  GB ANT  OR  BESTOW  BY  FORMAL  ACTION 

3A:  TO  PUT  INTO  THE  POSSESSION  OF  ANOTHER  FOR  HIS  USE 

3C:  TO  COMMIT  TO  THW  TRUST  OR  KEEPING  OP  ANOTHER 

Predicato-Arqs:  /GIVE2/  (SUBJECT)  (OBJECT!  f  INDIRECT  OBJECT  1 

Informal  Exotic.  :  "IN  THE  SENSE  OF  SPONTANEOUS  GIVING,  ’GI VE2 ’  REQUIRES 
BOTH  DTPECT  AND  INDIRECT  OBJECTS;  NEITHER  CAN  BE  DELETED. 
ADTTTEnLY  THERE  ARE  SPECIAL  CIRCUMSTANCES  TN  WHICH  ONE  OR  BOTH 
OV  THESE  DOES  NOT  OCCUR  PER  SE  IN  SURFACE  STRUCTURE  — FOR 
INSTANCE  ’JOHN  ALWAYS  GIVES  GENEROUSLY  ON  FLAG  DAYS’;  BUT  NOTE 
THAT  IN  ’’HIS  SENTENCE  *  GENEROUSLY  *  GIVES  SOME  INFORMATION  ABOUT 
THE  DEEP  DTRPC"'  OBJECT  AND  ’ON  FLAG  DAYS’  ENABLES  ONE  TO  INFEF 
THE  NATUFE  OF  THE  INDIRECT  OBJECT.  OUR  nojNT  IS  THAT  GIVFN  ANY 
SIMPLE  SENTENCE  INVOLVING  ’  GT  VE2  ’  ,  NEITHER  DIRECT  NOR  INDIRECT 
OBJECT  CAN  AUTOMATICALLY  RE  DELETED."  (  °.  2131 

Semanal  *:  S-167 

Source:  KATZ.  J.  r  19721  SEMANTIC  THEoaY 

Accompmvinq  VorBs:  BUY,  SELL,  TRADE,  FXCHANGF,  SWAP,  RECEIVE,  LET, 

HIRE,  PENT,  LEND,  BORROW,  CHARTER 

Author’s  Sense:  "...A  "HOCESS  IB  WHICH  THE  POSSESSION  OF  SOMETHING  IS 
TTJAMSFE°PED  FROM  ONE  PERSON  TO  ANOTHER  WITHOUT  ANYTHING  BEING 
GIVEN  IN  RETURN.  ...(7.1761  T  A  1  JOHN  GAVE  THE  BOOK  TO  MARY"  (  P. 
3UH1 

W7  Sense:  VT  1:  Tn  MAKE  A  PPE5ENT  OF 

2A;  TO  GRANT  OP  bFSTOW  BY  FORMAL  ACTION 
3  A:  TO  nUT  INTO  THE  POSSESSION  OF  ANOTHER  FOR  HIS  USE 
3C:  tq  COMMIT  TO  THE  Tprjsr  OP  KFEPTNG  OF  ANOTHER 

Preliffl.  Oil  al.  ;  BY  REMOVING  FROM  THE  DEFINITION  OF  "SELL"  THE  PROCESS 

SEMANTIC  MARKER  WHICH  INDICATES  THE  SUM  JF  MONEY  FOR  WHICH  AN 
ITEM  IS  SOLD,  the  LEXICAL  READING  FOB  ''GIVF"  CAN  BE  OBTAINED. 

T  PARAPHRASE  OF  KATZ  ON  P.  248  1 

Predicate-Arqs  :  ’GIVE’;  (SUBJECT-DF)  (OBJBCT-OF)  (INDIRECT-ODJFCT-DF) 
(INFLFXIONAL) 

Cooponcn  ts :  ((CONDITION)  (POSSESSES  Y)  OF  X  AT  T-I) 

((CONDITION)  (POSSESSES  Y)  OF  Z  AT  T-J) 

coramnent.  Composition:  THE  PROCESS  OF  GIVING  TS  REPRESENTED  BY  A 

TRANS  IT  TON  FROM  THE  STATE  REPRESENTED  BY  THE  FIRST  COMPONENT  TO 
THE  STATE  REPRESENTED  BY  THE  SECOND  COMPONENT.  'X'  REPRESENTS 
THE  SUBJECT.  VY'  THE  OBJECT,  AND  ’Z’  THE  INDIPECT  OBJFCT.  'T-I* 
INDICATES  THF  INITIAL  STATE  AND  'T-J'  THE  TERMINAL  STATE. 

Select  ion  al  Features:  'Y'  =  <  (PHYSICAL  OBJECT) > 

•X'  =  <  (HUMAN )  6  { NOT-  IN  FA  N  T)  > 

•Z'  ~  <  (HUMAN)  S  (NOT-INFANT)  > 

S OLA p  Consents:  F^TZ'  DEFINITION  HAS  the  OUTSTANDING  QUALITY  OF 

PEP  RESENTING  THE  PROCESS  INVOLVED  IN  GIVING.  IT  IS  DEFICIENT  IN 
OTHER  z ES"ECTS,  HOWEVER.  FIRST,  IT  PAILS  TO  INDICATE  THAT  T"E 
SUBJECT  ('X')  INITIATES/C  A  USES  THE  PROCESS.  SECOND,  IT  DOES  NOT 
INDICATE  THAT  IN  THE  TERMINAL  STATE  (T-J)  THE  SUBJECT  (X)  NO 
LONGER  POSSESSES  THE  OBJECT  (Y) . 
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3.2.3  Explanatory  Notes  on  Descriptive  Constants 

To  aid  the  user  in  determining  the  significance  of  the  analyses  proposed,  the 
archive  includes  a  set  of  notes  explaining  as  precisely  as  possible  the  author's 
intention  in  using  a  particular  descriptive  constant  or  non-standard  symbol. 

The  notes  following  are  those  needed  for  the  semantic  analysis  of  'give'  by 
Bendix,  which  is  found  in  the  preceding  section. 


Constant:  CAUSE(S) 

Explan  t:  E-113 

Source:  BENDIX,  EDWARD  r  1 96b 1 

Definition:  "W”  USE  THE  LABEL  'CAUSE*  FOP  THE  COMPONENT  in  QUESTt0N 

HESF  VHFN  TESTS  INDICATE  THE  PRESENCE  OF  THE  COMPONENT  THAT 
•A'  (IP  E^C.J  DOES  OR  IS  SOMETHING  AND,  EUPTHEP,  THAT 

'’’HIS  ACTTCN  OR  STATE  OF  'A'  LEADS  TO,  RESULTS  IN,  OR  * AY 
OTHERWISE  BE  SAID  TO  3E  CONNECTED  WITH  ANOTHEP  ACTION  OR  STATE 
IN  A  WAY  THAT  WE  MAY  CALL  CAUSAL.  ULTIMATELY  WE  WOULD  WANT  TO 
DEFINF  THE  METALINGUISTIC  TERM  'CAUSF'  MOPE  PRECISELY.  THIS 
WOUin  Br  INTENDED  AS  A  DEFINITION  NOT  OF  CAUSATION,  BUT  OP  WHA T 
W*  CLAIM  A  SPEAKER  IS  ASSERTING  WHEN  HE  USES  A  PORM  WHOSE 
"EARING  WE  DEFINE  AS  CONTAINING  T HP  COMPONENT  •  CAUSE*  .*»  TP. 

63  1 

SOLAP  conments:  IT  APPFAPS  to  BE  T  H p  CASE  THAT  THE  CONSTANT  'CflUSE'  IS 
BEING  USED  TO  EXPRESS  THF  NOTION  'BRING  ABOUT  BY  INTENTIONAL 
ACTION*.  SEE  THE  SOLAR  COMMENTS  IN  SEMANTIC  ANALYSIS  S-102  AND 
THE  CONCEPTUAL  ANALYSES  0"  'CAUSE  (I-1U1)  AND  'INTENTIONAL 
ACTION*  (T -  10  3)  . 

Word  Analyzed:  GIVE,  TAKE 
Se natal  *:  S-102,  S-103 

Constant:  A-  (A»-) 

Exolan  *:  E-114 

Source:  BENDIX,  EDWARD  f  19661 

Definition:  "THE  EXISTENTIAL  QUANTIFIER  'THERE  IS  A. . . ;  IS  INDICATED  IN 
A  DEFINITION  BY  PREFIXING  THE  INDEPINIT*  ARTICLE  "A-  (AN-)** 
TO  THE  FIRST  OCCURRENCE  OF  THE  QUANTIFIED  VARIABLE  IN  THE 
DEFINITION."  TP.  14  1  "THUS,  THE  ABBREVIATED  NOTATION  *B  AN-R" 
A*  KAY  ALSO  BE  READ  *  THEPF  IS  A  RELATION  SUCH  THAT  R  IS  IN 
THE  RFLATION  TO  A  AND  THE  RELATION  IS  A  MEMBER  OF  THF 
SUBCLASS  OF  H-R  EL  ATION  S '  .  FURTHERMORE,  "A  (AN)"  WILL  RE 
PREFIXED  ONLY  TO  THE  FIRST  CCCURRENCE  OF  A  QUANTIFIED 
VARIABLF  IN  A  DEFINITION,  IN  IMITATION  OF  THE  OBJECT 
LANGUAGF.  IN  THE  INFORMAL  REPHR  AS  I NGS  OF  DEFINITIONS, 

"SOME"  WILL  OCCASIONALLY  BE  USED  FOP  THIS  "A  (AN)"  WHEN  A 
GIV^N  REPHRASING  APPEARS  THEREBY  TO  BE  REINDEX  £D  MORE 
NATURALLY  OR  FAITHFULLY. 

Word  Analyzed:  GIVE,  TAKE 
Senanal  •:  S-102,  S-103 
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Constant:  A ,  8,  C,... 

FxDlan  #:  E-115 

Source:  3ENDIX,  EDWARD  f  ,9661 

Definition:  THE  LETTERS  "A,  B,  C. . .  "  REPRESENT  VARIABLES  OR  THE 

SYNTACTIC  POSITIONS  OF  NOIJ*?  PHRASE.  IT  is  TO  SIHPLIPY  00R 
DESCRIPTION  THAT  KE  USE  THFSE  LETTERS  AS  SYNTACTIC  SYMBOLS 
RATHER  THAN  THE  MORE  USUAL  NOTATION  NP1,  RP2,  NP3,  ...OP  NP, 

N  P  * ,  NP* ' , . .. .  WHEN  THESE  LETTERS  APE  ENCLOSED  IN  SINGLE 
QUOTES,  EITHEP  ALONE  { E.  G.  ,  •  A  * )  OR  AS  PART  OP  A  LONGER  STPI 
(E.  G.  ,  *  A  HAS  B'  )  ,  THEY  REPRESENT  REFERENTS  OP  TOKENS  OF 
TYPES  OCCURRING  IN  THE  GIVEN  NOUN-PHRASE  POSITIONS...."  f 
131 

word  Analyzed:  GIVE,  TAKE 
Seaanal  *:  S-102,  S-103 

Constant;  HAS  (PHI 

Fxplan  *:  F-116 

Source:  BFNDIX,  EDWARD  f  1  966  1 

Definition:  "THE  BASIC  COMPONENT  OF  THE  MEANING  OF  'A  HAS  B '  HAS  BEEN 
DEFINED  AS  'THERE  IS  A  RELATION  BE^WFFN  A  AND  D' .  THE 
SUBCLASS  OF  RELATIONS  TO  WHICH  THIS  RELATION  MAY  BELONG  WAS 
DEFINED  FURTHER  IN  CHAPTER  3  IN  TERMS  OF  A  NUMBER  OF 
R  AP  APHR  AGING  CONSTRUCTIONS  OR  SOURCE-SENTENCE  TYPES.  9R  TEFL’ 
THEN,  USING  ' H*  TO  INDICATE  THE  RECIPROCAL  RELATION  OF  'HAVE' 
•A  HAS  P»-  'B  AN-RH  A'  (TO  BE  READ:  'B  IS  IN  AN  H-PELATION  Tf 
A'  OF  'THERE  IS  AN  H-R  ELATION  BETWEEN  B  AND  A')  WHERE:  'B 
AN-RH  A’  REPRESENTS  THE  CLASS  Op  PARAPHRASE  OR  SOURCE 
CONSTRUCTIONS  OF  ‘A  HAS  B'"  TP-  62  1 
Word  Analvzed:  GIVE,  TAKE 
Semanal  #:  S-102,  S-103 

Constant :  AN-F 

ExDlan  #:  E-118 

Source;  3ENDIX,  EDWARD  f  1966  1 

Definition:  "'  AF*  SYMBOLIZES  A  PUNCTTON  WITH  VAFTABLE  'A'.  WF  PLACE  T!!i 
SYfIROL  FOR  THE  VARIABLE  BEFORE  THE  SYMBOL  FOR  THE  FUNCTION, 
FATHER  "HaN  IN  THE  MORE  CUSTOMARY  ORDER  *  F ( X)  *  OR  «FX' ,  TO 
INDICATE  CORRESPONDENCE  WITH  THE  SUBJECT-PREDICATE 
CONSTRUCTION  (OR  WITH  THE  TOPIC-COMMENT  FORM)."  f?.  131  IN  TH 
INFOPMAI  EXPLICATION  OF  'GIVE',  'C  AN-F'  IS  TAKEN  AS  'C  DOES  C 
IS  SOMETHING',  f  P.  621 
Word  Analyzed:  GIVE 
Seaanal  •:  s-102 
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Constant:  R 

Exolan  #:  E— 119 

Source:  BENDIX,  EDWARD  f  19661 

Definition:  "’ARB*  ,  WITH  VARIABLES  *A»  AND  »  B* ,  STANDS  POR  A 

P  ELATION  BPTWEFN  *  A'  AND  'B».n  fP.  131  T  THE  DEFINITION  O?  }  »B 
AN-R  C’  riSl  (  '  B  IS  IN  SOKE  RELATION  TO  C'  OR  *  THERE  IS  A 
RELATION  BBTBEPN  B  AND  C‘—  I.E.,  THE  SUBCLASS  OF  RELATIONS  IS 

NOT  SPECIFIED)  "  TP.  621 
Word  Analyzed:  GIVE,  TAKE 
Semanal  #:  5-102,  S-103 

Constant:  ' 

Exolan  #  :  E- 1 20 

Source:  BENDTx,  EDWARD  r 19661 

Definition:  "SINGLE  000TES  ARE  "BED  HERF  TO  ENCLOSE  MEANINGS  OR 

FLFNFNTS  0?  BEARING  SOCH  AS  COBPONENTS,  DEFINITIONS,  GLOSSES, 
TRANSLATIONS,  OP  INTERPRETATIONS.  UNDERLINING  IS  FRED  GENTLY 
EMPLOYED  FDR  INDICATING  FOR  B  S,  SYNTACTIC  ELEMENTS,  OR  THE 
DEFT  NIENDDM  OF  A  DFFI  NTT  I  ON.  "  f  P.  13  1 
SOI.AR  comments:  Sinc<>  underlininq  is  not  possible  in  SOLAR,  we  have 
replace1  Rendix*  underline  with  sinqle  quotes  around  the 
underscored  word(s).  We  believe  no  siqnificant  ambiguities 
have  been  introduced  by  this  convention. 

Word  Analyzed:  GIVE,  TAKE 
Seaanal  #:  S-102,  S-103 

3.2.4  Conceptual  Analyses  of  the  Notions  'Cause',  ’Intentional  Action', 
and  'Responsible' 

The  analyses  of  'cause'  and  'intentional  action'  are  directly  relevant  to  Bendix's 
semantic  analysis  of  'give'  (see  S-102  in  Section  3.2.2).  The  analysis  of 
'responsible'  is  included  here  because  it  ha3  close  conceptual  connections  with 
the  other  two,  as  indicated  under  its  "qualifications"  heading.  Such  cross- 
references  will  subsequently  be  reinforced  by  more  detailed  cross-references 
within  the  analyses  proper  when  a  sufficiently  large  number  of  conceptual  analyses 
have  been  prepared  so  that  such  interconnections  can  be  set  up  in  a  systematic 
way. 
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Notion:  CAUSE 
Constants:  CAUSE,  CAU5E1 

Moris:  GIVE  fS-1021,  TAKE  rs-1031,  ORTV  E  T  S-129,  S-1301,  DROP  T  1  3  2  1 
Inteq.  Sum  #:  1-101 

Source:  LEWIS ,  DAVID  f  197.3]  "CAUSATION",  THE  JOURNAL  OF  PHILOSOPHY, 

PP. 

556-567. 

KTM,  JAESWON  r  1 973  1  "CAUSES  AND  COUNT ERFACTUALS",  THE  JOURNA 
OF  PHILOSOPHY,  PT.  570-572. 

LEWIS,  DAVID  f  1973  D  1  COU NT ERPACTU ALS  (BLACKWELL  AND  HARVARD) 
ANSCOMBE,  G.  L.  H.  T  19711  CAUSALITY  AND  DETER HI  NATION. 

CAMB RIDGE 

HACKIE,  J.  I.  f  19651  "CAUSES  AND  CONDITIONS"  AMERICAN 
PHILOSPHICAL  QUARTERLY,  PP.  2U5-264. 

WHITE,  HORTON,  FOUNDATIONS  OF  HISTORICAL  KNOWLEDGE  (HARPER  ANI 
HOW,  1965  1  PP.  14-104. 

W7  Senso:  1A:  SOHETHING  THAT  OCCASIONS  OR  EFFECTS  A  RESULT 
Qualifications:  THE  NOTION  OF  •CAUSE*  ANALYZED  HERE  IS  THAT  OF  ONF 

PARTICULAR  EVENT  BEING  A  CAUSE  OF  ANOTHER  PARTICULAR  EVENT. 

Anal  irsis:  "IK  "HIS  CASE... THE  SO  CALLED  CAUSE  IS,  AND  IS  KNOWN  TO  BE,  A  A 
INSUFFICIENT  BUT  NECESSARY  PART  0F  A  CONDITION  WHICH  IS  ITSELF 
UNNECESSARY  BUT  SUFFICIENT  FOR  THE  RESULT."  (  HACKIE,  J.  L. 
19661.  LEWIS  STATES  THE  PEGULARTTY  ANALYSES  OF  CAUSATION  AS 
FOLLOWS:  "LET  C*  BE  THE  PROPOSITION  THAT  C  EXISTS  (OR  OCCURS) 
AND  LET  E*  3F  TK  E  PROPOSITION  THAT  F  EXISTS.  THEN  C  CAUSES  E, 
ACCORDING  TO  A  TYPICAL  REGULAR  ITT  ANALYSIS,  IF  Til  C*  AND  E* 

APE  TRIJF  ;  AND  f?l  FOR  SOME  NON-EHPTY  SET  L.«  OF  TRUE 
L AW- PROPOSITIONS  AND  SOHE  SET  F$  OF  TRUE  PROPOSITIONS  OF 
PART ICUT  AF  FACT,  IS  AND  FS  JOINTLY  DO  NOT  IHPLY  E*,  AND  *S 
ALONE  DOES  NOT  IHPLY  C* — >E*. ...  I'1’  REMAINS  TO  BE  SEEN  WHETHER 

ANY  REGULARITY  ANALYSIS  CAN  SUCCEED  IN  DISTINGUISHING  GENUINE 
CAUSES  FPOH  EFFECTS,  EPIPHENOHENA,  AND  PPEFMPTED  POTENTIAL 
CAUSES--AND  WHETHER  TT  CAN  SUCCEED  WITHOUT  FALLING  VICTIH  TO 
WORSE  PROBLEMS,  WITHOUT  PILING  ON  THE  EPICYCLES,  AND  WITHOUT 
DEPARTING  PPOH  THE  FUNDAMENTAL  IDEA  THAT  CAUSATION  IS 
INSTANTIATION  OF  PEGULAPirTES.  "  (LEWIS  (19731  PP.  556-557;  TN 
TRANSCRIBING  THIS  PASSAGE  WE  USED  A  ’S'  ADSCRIPT  TO  INDICATE 
SCR-rpT  CRPITAL  LETTERS  TN  THE  ORIGINAL,  AND  A  •*•  ADSCRIPT  TO 
INDICATE  BLOCK  CAPITALS  .)  LEWIS  THEN  GOES  ON  TO  GIVE  A 
MODIFIED  SINE  QUA  NON  ANALYSIS:  "CAUSAL  DEPENDENCE  AfPNG  ACTUAL 
FV  ENTS  IMPLIES  CAUSATION.  IF  C  AND  E  ARE  TWO  ACTUAL  EVENTS 
SUCH  THAT  E  WOULD  NOT  HAVE  OCCURRED  WITHOUT  C,  THEN  C  IS  A 
CAUSE  OF  E.  BUT  I  PEJErT  THE  CONVERSE.  CAUSATION  MUST  ALWAYS 
BE  TRANSITIVE;  CAUSAL  DEPENDENCE  DAY  NOT  BF;  SO  THERE  CAN  BE 
CAUSATION  WITHOUT  CAUSAL  DFPENDSNCF .  LET  C,  D,  AND  F  BE  THPFE 
ACTUAL  F VE NTS  SUCH  THAT  D  WOULD  NOT  HAVE  OCCURRED  WITHOUT  C  AND 
E  WOULD  NOT  HAVE  OCCUP"FD  WITHOUT  D.  THEN  C  IS  A  CAUSE  OF  E 
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EVES  IF  E  WOULD  STILL  BATE  OCCURRED  (OTHERWISE  CATSED)  WITHOUT 
C.  WE  EXTEND  CAUSAL  DEPENDENCE  TO  A  TRANSITIVE  RELATION  IN  THE 
USUAL  WAY.  LET  C,  D,  E ,  ...  BE  A  FINITE  SEOnENCE  OP  ACTUAL 
PARTICULAR  EVENTS  SUCH  THAT  D  DEPENDS  CAUSALLY  ON  C,  E  ON  D, 

AND  SO  ON  THROUGHOUT.  THEN  THIS  SEQUENCE  IS  A  CAUSAL  CHAIN. 
FINALLY,  ONE  FVENT  IS  A  CAUSE  0?  ANOTHER  IFF  "'HERE  BUSTS  A 
CAUSAL  CHAIN  LEADING  FROH  THE  FIRST  TO  THR  SECOND.  THIS 
COMPLETES  MY  COUNTER? ACTU AL  ANALYSIS  OF  CAUSAL  TON."  LEWIS 
r 1973 1  ».  563. 

Critique:  FOP  CRITICISM  OF  THE  LEWIS  ANALYSIS,  SEE  KIM  r  19731  .  THE 
WORST  CASE  FOR  LEWIS  IS  THAT  OF  AN  EVENT  B  THAT  IS  A  SUBEVENT 
OF  AN  EVENT  A.  FOR  A  SOPHISTICATED  REGULARITY  ANALYSIS,  SEE 
MACKIE  (19651.  FOR  ANOTHER  DISCUSSION  OF  THE  REGULARITY  THEORY 
PLUS  A  DISCUSSION  OP  HOW  TO  ANALYSE  'THE  CAUSE  OF'  SEE  WRITE 
r 19651  •  FOR  DISCUSSION  OF  THE  NEED  TO  ALLOW  CAUSATION  UNDER 
INDETERMINISM  SEE  ANSCOMBE  (1971  1  . 

Notion:  INTENTIONAL  ACTION 
Constants:  cause,  CAUSE1 

words:  GIVE  rs-1021,  '’’AKE  rs-1031,  DRIVE  (S-129,  S-1301,  DROP  TS-1321 
Senanal  #:  S-107,  S-103,  S-129,  S-130,  S-132 
Inteq.  Su«.  *:  1-103 

Source:  ARMSTRONG,  D.  1.  r  19711  "ACTING  AND  TRYING",  UNPUBLISHED 
PRESIDENTIAL  ADDRESS  AUSTRALASIAN  ASSOCIATION  OP 
PHILOSOPHY. 

CHISHOLM,  R.  M .  "THE  DESCRIPTIVE  ELEMFNT  IN  THE  CONCERT  OP 
ACTION  ",  JOURNAL  OF  PHILOSOPHY,  1964,  PP.  614-625. 
CHISHOLM,  R.  M.  [  19701  "THE  STRUCTURE  OF  INTENTION",  JOURNAL 
OF  PHILOSOPHY,  PP.  633-647. 

DAVTDSON,  0.  (19631  "ACTIONS,  REASONS,  AND  CAUSES",  JOURNAL 

OF  PHILOSOPHY,  PP.  685-700. 

OLOENOUIST,  ANDREW  T  19671  "CHOOSING,  DECIDING,  AND  DOING"  IN 
THE  ENCYCLOPEDIA  OF  PHILOSOPHY,  2,  PP.  96-104. 

W7  Sense:  2:  THE  BRINGING  ABOUT  OF  AN  ALTERATION  BY  FORCE  OR  THROUGH  A 
NATURAL  AGENCY 

Qualifications:  THE  NOTION  OF  'INTENTIONAL  ACTION'  ANALYZED  HERE 

PRESUPPOSES  THE  NOTIONS  OF  'TRYING',  'ACTION'  AND  'CAUSF'.  IT 
IS  CLOSELY  LINKED  TO  SSCH  OTHER  NOTIONS  AS  'INTENTIONS',  'GOAL' 
A  NL'  'RESPONSIBLE'. 
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Aoaltsis:  IN  "ACTING  AND  TRYING",  D.  M.  ARMSTRONG  GIVES  THE  POL  LON  IN  G 
ANi.IYSIS  OF  "A  INTENTIONALLY  PERPORMS  ACTION  P":  A  DID  P 
INTENTIONALLY  IF  AND  ONLY  IF  f 1  1  A  DID  P  (21  A  TRIED  TO  DO  P 
Ml  A'S  TRYING  TO  DO  P  CADSED  A  TO  DO  T.  AS  ARMSTRONG  NOTES  (1 
1«)  ,  THTS  DEFINITION  IS  STILL  TOO  WEAK:  MOVEMENTS  NOT 
INTU  TTTV  FLY  CONSIDERED  INTENTIONAL  ACTIONS  SATISFY  MW  31.  A* 
EXAMPLE  DASEP  ON  AN  INSIGHT  OP  CHISHOIM,  "THE  DESCRIPTIVE 
FLEMENT  IN  7HE  CONCEPT  OF  ACTION",  SHOWr  WHY.  ARMSTRONG  PUTS  ] 
AS  FOLLOWS:  "A  SETS  OUT  ACROSS  TOWN  WITH  THE  OBJECT  OF  XILLI* 
HTS  MNCI.E.  THE  EXCITEMENT  GENFRATFP  BY  THIS  PLAN  CAUSES  HIM  TO 
DP IV  F  FAST  AND  CARELESSLY.  AS  A  RESULT  HE  KNOCKS  DOWN  AND 
KTLLS  A  PEDESTRIAN  WHOM  HE  DOES  NOT  SFF.  TT  TURNS  OUT  TO  BE  A* 
UNCLE.  A'S  ATTEMPT  TO  MURDER  HIS  UNCLE  BPOUGHT  IT  ABOUT  THAT 
HE  KILLED  HTS  HNCLE.  BUT  HIS  KILLING  WAS  NOT  INTENTIONAL." 
ARMSTRONG  GOES  ON  TO  SUGGEST  THAT  TR t  A” A LYSIS  NEEDS  TO  BE 
FILLED  OUT  SO  THAT  A'S  TRYING  TO  DO  »  BRINGS  ABOUT  P  ACCORDING 
TO  A  CAUSAL  PATTERN  INVOLVING  A'S  PURPOSES  AND  BELIEFS  THAT  CA 
BE  SHADOWED  OUT  IN  A  TRAIN  OF  PRACTICAL  REASONINGS.  WE  CAN 
FOP  MA  LT  7.c  THIS  AS  FOLLOWS:  A  DOSS  P  IN7  ENT  ION  ALLY  IF  AND  ONLY 
IF  r  11  A  DOES  P  BY  DOING  Q  F  2]  A  TRIES  TO  DO  P  BY  DOING  Q  T  3  1 
A'S  TRYING  T0  DO  P  BY  DOING  Q  CAUSES  A  TO  DO  Q  f  4 1  1 1  A' S  DO  IN 
0  CAUSES  EVENT  E,  AND  A'S  DOING  P  CONSISTS  IN  CAUSING  E,  THEN 
A'S  DOING  0  CAUSES  E  ACCORDING  ^O  A  CAUSAL  CHAIN  THAT  WAS 
PART  I  ALLY  ANTICIPATED  BY  A,  AND  A'S  PARTIALLY  ANTICIPATING  C 
CAUSED  A  TO  DO  (OR  TRY  TO  DO)  P  BY  DOING  0.  (THIS  IS  TO  BE 
UNDERSTOOD  AS  INCLUDING  THE  POSSIBILITY  THAT  Q  IS  P)  .  IN  THE 
ABOVE  FV AMPLE,  IT  SHOULD  BE  CLEAR  THAT  THOUGH  A  KILLED  HIS 
UNCLE  P.Y  HITTING  HIM  WITH  A  CAR,  CLAUSES  (  2  ]— f  4 1  FAIL  SINCE  A 
DIDN'T  TRY  TO  KILL  HIS  UNCLE  BY  HITTING  HIM  KITH  A  CAP. 

SUPPOSE,  HOWEVER,  THAT  (11  A  INTENTIONALLY  KILLS  HIS  UNCLE  BY 
SHOOTING  A  GUN.  THEN  PRESUMABLY,  (  2  1  HE  WILL  HAVE  T  PI  ED  TO 
KILL  TH ?  UNCLE  BY  SHOOTING  THE  GUN.  AND  Ml  THAT  ATTEMPT  TO 
KILL  HIS  UNCLE  BY  SHOOTING  THE  GUN  WILL  BE  THE  REASON  FOF  (AND 
HENCE,  CAUSE  OF)  HIS  SHOOTING  THE  GUN.  BUT  A'S  SHOOTING  THE  GUI 
CAUSES  HIS  UNCI.E  TO  DIE  AND  A'S  KILLING  HIS  UNCLE  CONSISTS  IN 
A'S  CAUSING  HIS  UNCLE  TO  DIE,  SO  IF  (41  IS  TO  BE  SATISFIED  T HE! 
A'S  SHOOTING  THE  GUN  MUST  CAUSE  HIS  UNCLE  *0  DIF  ACCORDING  TO  J! 
CAUSAL  CHAIN  C  THAT  WAS  PAP TIA LI.Y  ANTICIPATED  BY  A  (NAMELY,  TH1 
BULLET  HITTING  THE  U NrLE  AND  CAUSING  DEATH),  AND  A'S 
ANTICIPATING  THAT  CAUSAL  CHAIN  MUST  BF  A  REASON  FOR  (AND  HENCE 
A  CAUSE  OF)  A'S  KILLING  THE  HNCLE  BY  SHOOTING  THE  GUN.  THAT  HOI 
JUST  ANY  CAUSAL  CHAIN  FROM  SHOOTING  THE  GUN  TO  THE  UNCLE'S 
DYING  IS  SUFFICIENT  FOR  A  TO  HAVE  INTENTIONALLY  KILLED  HIS 
UNCLE  CAN  BF  SEEN  BY  CONSIDERING  THE  POSSIBILITY  THAT  THE 
BULLET  MISSES  BUT  THE  EPISODE  IS  SO  UNNERVING  TO  A'S  UNCLE  THAT 
HE  HAS  A  HEART  ATTACK  AND  DIES.  IF  A  HAS  INTENDED  THAT  HE  KILL 
HIS  UNCLE  BY  THE  BULLET  PIERCING  THE  UNCLE'S  EODY,  THEN,  IN 
THIS  CASE  IT  SEEMS  THAT  A  DID  NOT  INTENTIONALLY  KILL  HIS  UNCLE. 
IF.  ON  THE  OTHER  HAND,  A  HAD  THOUGHT, "WELL,  EVEN  IF  THE  BULLET 
MISSES  MAYBE  THE  OLD  GEEZER  WILL  HAVE  A  HEART  ATTACK  AND  DIE", 
THEN  A  WOULD  HAVE  INTENTIONALLY  KILLED  HIS  UNCLB. 
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Critique:  THERE  ARE  STILL  SONS  DIFFICULTIES  HITH  THIS  ANALYSIS.  THE 
PRACTICAL  REASONING  DR  CAUSAL  CHAIN  ANTICIPATED  BY  A  COULD 
PERHAPS  BE  WRONG  IN  SOME  DETAILS  (WHILE  SUBSTANTIALLY  CORRECT) 
IN  A  CASE  OF  INTENTIONAL  ACTION,  THOUGH  THIS  ANALYSIS  DOESN'T 
DFAL  WITH  SUCH  CASES.  IT  SHOULD  BE  NOTED  THAT  THE  ANALYSIS 
AUTOMATICALLY  YIELDS  A  NOTION  OF  'AGENT  OF  INTENTIONAL  ACTION': 
A  "IS  THE  AGENT  OF"  THE  INTENTIONAL  ACTION  P  IF  AND  ONLY  IF  A 
IS  THE  ON*  WHO  DOES  P  INTENTIONALLY.  FOR  A  DEFENSE  OF  REASONS 
AND  INTENTIONS  AS  CAUSES  OP  ACTIONS,  SEE  DAVIDSON  f  19631.  FOR 
"ORE  ON  'INTENTION',  SEV.’  CHISHOLM  f  19701.  A  SURVEY  ARTICLE, 
DEALING  WITH  RELATIONS  BETWEEN  ACTION,  CHOICE  AND  DECISION  IS 
OL DEN OU 1ST,  f  19691. 

Notion:  RESPONSIBLE 
Constants:  RESPONSIBLE 
words:  EXCUSE  rs-1011 
Intpq.  Sum  #:  1-10  2 

Source:  CHISHOLM,  R.  (  1967  1  ‘"HF  COULD  HAVE  DONE  OTHERWISE'",  THE 
JOURNAL  OP  PHILOSOPHY,  PP.  409-417. 

E  RA  NKFOPT,  HARRY  (  19691  "ALTERNATE  POSSIBILITIES  AND  MORAL 
RFSPONSTBI LITY",  THE  JOURNAL  OF  PHILOSOPHY,  DECEMBER  4, 
PP.  829-839. 

FeA NKFUCT,  HARRY  (19711  "FREEDOM  OF  THE  WILL  AND  THE  CONCEPT 
OF  A  PERSON".  THE  JOURNAL  O*  PHILOSOPHY,  JANUARY  14,  PP. 
5-20. 

HOI.  BO  FOB ,  L.  C.  T  19721  "BT.AME,  PRAISE,  AND  CREDIT", 

PROCEEDINGS  OF  THE  ARTSTOTELIAN  SOCIETY,  1971-72,  PP. 

8C- 100. 

HOOK,  SIDNEY  (  1966  1  "NECESSITY,  INDETERMINISM,  AND 

SENTIMENTALISM",  IN  FREE  UILL  AND  DETERMINISM  (ED.  BY  B. 
3ER0ESKY  )  .  HARPER  6  ROW,  V.Y.,  PP.  76-45. 

KAUFMAN,  ARNOLD  (19671  "RESPONSIBILITY,  MORAL  AND  LEGAL",  IN 
THE  ENCYCLOPEDIA  OF  PHILOSOPHY  (ED.  BY  P.  EDWARDS).  THE 
*ACMTLLAN  CO.  f.  THE  FREE  PRESS,  NEW  YORK,  PP.  183-188. 
PASKE,  GF'IALD  (  1970  1  "RESPONSIBILITY  AND  THE  I MCOMPAP ABILITY 
PRINCIPLES",  THE  PER  SONA  LI  ST,  *P.  477-435. 

W7  some:  1A:  "LIABLE  TO  BE  CALLED  UPON  TO  ANSWER  AS  ’’HE  PRIMARY  CAUSE, 
MOT  IV  F,  OR  AGENT". 

Oualif  icat  ions:  THE  NOTION  OF  'RESPONSIBLE'  ANALYZED  HERE  HAS  CLOSE 
CONCEPTUAL  CONNECTIONS  WITH  NUMEPOUS  OTHER  PHILOSOPHICALLY 
CENTPAI.  NOTIONS,  E.G.  ,  'INTENTION',  'INTENTIONAL  ACTION", 
•ACTION',  'CONSCIOUSNESS',  'RATIONALITY',  'FREEDOM',  'CAUSE', 
•FREEWILL',  'FREE  AGENT',  'AGENT',  'DETERMINISM',  AND 
•JUSTICE';  THE  ENTRIES  FOR  THESE  NOTIONS  SHOULD  BE  CONSULTED. 
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Analysis:  "IF  WE  MAKE  A  LIST  OF  THE  CIRCUMSTANCES  BEHIND  ACTIONS  FOB 

WHICH  WE  HOLD  INDIVIDUALS  RESPONSIBLE  AND  THOSE  POR  WHICH  WE  DO 
NOT,  WE  SHALL  FIND  THAT  AS  A  RULE  THE  FIRST  CLASS  CONSISTS  OF 
THOSE  IN  WHICH  PRAISE  AND  REWARD,  BLAMF  AND  PUNISHMENT,  TEND  TO 
INFLUENCE  THE  FUTURE  CONDUCT  OF  THOSE  INVOLVED  AND/OR  THOSE 
TFKPTED.  THIS  IS  NOT  THE  WHOLF  STORY....  THE  BEHAVIOR  OF 
1 1  FANTS ...  IS  MODIFIABLE  BT  APPROPRIATE  REWARD  AND  PU NI5HHENF 
EVEN  THOHSH  WE  DO  NOT  HOLD  THEM  MORALLY  RESPONSIBLE.  BUT  AS 
THE  AGE  OF  RATIONALITY  APPROACHES,  WE  GRADUALLY  DO.  THIS 
SUGGESTS  THAT  IN  ADDITION  TO  SUSCEPTIBILITY  TO  REWARD  AND 
PUNISHrfFNT,  WE  ATTRIBU'/fi  RESPONSIBILITY  WHERE  THERE  IS  A 
TENDENCY  TO  RESPOND  TO  VALID  REASONS,  TO  BFHAVE  RATIONALLY,  TO 
RESPOND  TO  HUMAN  EMOTIONS  IN  A  HUMAN  WAY.  PERHAPS  A  THIRD 
ELEMENT  INVOLVED  IN  THE  ATTRIBUTION  OF  MORAI  RESPONSIBILITY  to 
VOLUNTARY  ACTION  IS  THE  ASSUMPTION  THAT  VOLUNTARY  ACTION  IS 
APPROVED  ACTION.  A  MAN  IS  MORALLY  RESPONSIBLE  FOR  AN  ACTION  HE 
COMMITS  TO  THE  EXTENT  THAT  HE  APPROVES  OF  IT.  IF  HE  SINCERELY 
DISAPPROVES  OF  HIS  ACTION,  REGARDS  IT  AS  WRONG  AND  CONDEMNS  IT 
AS  WRONG,  BUT  STILL  COMMITS  IT,  WE  TEND  TO  REGARD  HIM  AS  ILL, 

AS  ACTING  UNDER  "COMPULSION".  IT  IS  SOME  SUCH  CONSDER ATIONS  AS 
THIS  THAT  LIES  BEHIND  OUR  EXTENUATION  OF  CERTAIN  KINDS  OF 
APPARENTLY  VOLUNTARY  ACTION  (AS  WHEN  WE  SAY,  'HE  DIDN’T  MEAN  TO 
DO  rrn,  ESPECIALLY  WHERE  IGNORANCE  IS  PRESENT"  HOOK,  SIDNEY 
r  10661,  pp.  47-48 .  THP.  RELATIONS  HIP  BETWEEN  COERCION  AND  MORAL 
RESPONSIBILITY  HAS  RECENTLY  BEEN  CLARIFIED  BY  FRANKFURT.  MOST 
PHILOSOPHERS  HAVE  AGREED  THAT  WHAT  FRANK FUPT  CALLS  "THE 
PRINCIPLE  OF  ALTERNATE  POSSIBILITIES"  IS  TRUE,  VIZ.,  THAT  A 
PERSON  IS  MORALLY  RESPONSIBLE  FOR  WHAT  HE  HAS  DONE  ONLY  IF  HE 
COIJND  HAVE  DON  F  OTHERWISE.  HOWEVER,  FRANKFURT  ADDUCES  EXAMPLES 
THAT  SHOW  CONCLUSIVELY  THAT  THE  PRINCIPLE  OF  ALTERNATE 
POSSIBILITIES  IS  FALSE.  HE  OBSERVES,  "THE  PACT  THAT  A  PERSON 
COULD  NOT  HAVE  AVOIDED  DOING  SOMETHING  IS  A  SUFFICIENT 
CONDITION  OF  HTS  HAVING  DONE  IT.  BUT,  AS  SOME  OF  MY  EXAMPLES 
SHOW,  THIS  FACT  MAY  PLAY  NO  ROLE  WUATFVFP  TN  THE  EX°LANATION  OF 
WHY  HE  DID  IT.  IF  SOMEONE  HAD  NO  ALTERNATIVE  TO  PERFORMING  A 
CFPTATN  ACTION  BUT  DID  NOT  PERFORM  IT  BECAUSE  HE  WAS  UNABLF.  TO 
DO  o-n^PWISF,  HE  WOULD  HAVF  PERFORMED  EXACTLY  THE  SAME  ACTION 
VV  EN  IF  HE  COULD  HAVE  DONE  OTHF^WTSE.  THE  FOLLOWING  MAY  ALL  BE 
TRUE:  THERE  WERE  CT  RCUMST  A  NCPS  T  HAT  MADE  IT  IMPOSSIBLE  FOR  A 

PERSON  TO  AVOID  DOING  SOMETHING;  THESE  CIRCU HSTA NCES  ACTUALLY 
PLAYED  A  17 OLE  IN  BRINGING  TT  ABOUT  THAT  HE  DTD  IT,  SO  THAT  IT 
IS  CORRECT  TO  SAY  THAT  HE  DID  I?  BECAUSE  HE  COULD  NOT  HAVE  DONE 
OTHERWISE;  THE  PERSON  REALLY  W  ANT  FD  TO  DO  WHAT  HE  DTD;  HE  DTD 
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IT  BECAUSE  IT  MAS  MHAT  HE  REALLY  WANTED  TO  DO,  SO  THAT  IT  IS 
NOT  CORRECT  TO  SAY  THAT  HE  DID  WHAT  HF  DTD  ONLY  BECAUSE  HE 
COULD  NOT  HAVE  DONE  OTHERWISE.  UNDER  THESE  CONDITIONS,  THE 
PERSON  MAY  BE  MORALLY  RESPONSIBLE  FOR  WHAT  HE  HAS  DONE.  ON  THE 
OTHER  HAND,  HE  WILL  NOT  BE  MORALLY  RESPONSIBLE  FOR  MHAT  HE  HAS 
DONE  IF  HE  DID  IT  ONLY  BECAUSE  HE  COULD  NOT  HAVE  DONE 
OTHERWISE,  EVEN  IF  WHAT  HE  DID  WAS  SOMETHING  HE  REALLY  WANTED 
TO  DO.  ”  FRANKFURT,  HARRY  T  1969  1 

criti 3U" :  HOOK'S  CLAIM  THAT  "A  MAN  IS  MORALLY  RESPONSIBLE  FOR  AN  ACTION 
HE  COMMITS  TO  THE  EXTENT  THAT  HE  APPROVES  OF  IT”  IS  SUSPECT,  AT 
LEAST  PENDING  SOME  FURTHER  EXPLANATION  OF  'APPROVAL'.  IF  HOOK 
IS  USING  'APPROVAL*  IN  A  NON- TECHNICAL  SENSE,  THEN  IT  WILL 
FOLLOW  THAT  ONE  CJP'  EVADE  RESPONSIBILITY  FOR  ONE'S  ACTIONS 
MERELY  BY  DISAPPROVING  OF  THEM.  THE  TRUTH  THAT  IS  BEHIND 
HOOK'S  POSITION  IS  TH  AT  DISAPPROVAL  TS  EVIDENCE  THAT  THE  ACTION 
IS  NOT  REALLY  ONE'S  OWN.  THE  -COMPULSION”  OF  WHICH  HOOK  SPEAKS 
CAN  POSSIBLY  BE  EXPLAINED  VIA  A  SPLITTING  UP  OF  THE  INDIVIDUAL 
TNTO  PARTS,  SOME  ONE  OF  WHICH--THE  ONE  WITH  WHICH  THE  SEIF  IS 
IDENTIFIED — BEING  the  PARI  WHICH  DISAPPROVES  OF  THE  ACTION.  A 
POSSIBLE  APPROACH  IS  SKETCHED  IN  FRANKFURT  \ 1 97 1 1  WHERE  A 
DISTINCTION  BFTWEEN  FIRST-ORDER  VOLITIONS  AND  SECOND-ORDER 
VOLITIONS  (VOLITIONS  ABOUT  FIPST-ORDER  VOLITIONS)  IS  PUT  TO 
GOOD  USE.  FRANKFURT  MENTIONS  BUT  DOES  NOT  GO  INTO  THE 
COMPLEXITIES  OF,  HIGHER  THAN  SECOND-ORDER  VOLITIONS.  A  DETAILED 
ANALYSIS  OF  ‘HE  COULD  HAVE  DONE  OTHERWISE*  is  GIVEN  BY  CHISHOLM 
r  1967)  AND  AN  INTERESTING  TREATMENT  OF  THF  RELATION  BFTWEEN 
RESPONSIBILITY  AND  RATIONALITY  IS  GIVE  BY  PASKE  \ 19701  . 
RELEVANT  TO  FILLMORE'S  USE  OF  'RESPONSIBLE  (SEE  S-101)  BUT  TOO 
DETAILED  to  BE  INCLUDED  HERE,  TS  HOLBOFOW  T  1 97?  ]  .  A  SUMMARY 
OF  THE  F  EAL  T ION  SHIP  BETWEEN  MORAL  AND  LEGAL  RESPONSIBILITY  IS 
GIVEN  BY  KAUFMAN  (19671  . 


3.2.5  Other  Activities 


The  bibliographic  file  and  the  word  index  have  been  implemented  in  a  data 
management  system,  and  listings  have  been  produced  periodically. 

In  the  area  of  program  development,  we  have  completed  writing, compiling,  and 
debugging  the  programs  needed  to  restructure  the  W7  parsed  transcripts  into  a 
format  suitable  for  input  to  other  programs  that  will  produce  the  various  machine- 
derived  files.  One  of  these  latter  programs  (the  one  building  the  collocational 
feature  file)  has  been  written,  compiled,  and  debugged. 

We  have  publicized  the  archive  throughout  the  US,  Canada,  Australia,  and  Europe. 
Approximately  20  researchers  have  responded  to  our  solicitation  of  documents 
dealing  with  lexical  semantics,  and  about  35  have  expressed  interest  in  receiving 
data  from  the  archive. 
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3.3  PLANS 

During  the  remainder  of  this  contract  year,  LDA  will  be  focusing  on  five  tasks: 

1)  Continue  data  collection  from  the  literature.  This  will  involve 
extending  the  bibliographic  file  by  about  700  entries,  updating 
the  word  index,  and  more  than  doubling  the  semantic  and  conceptual 
analysis  files. 

2)  Continue  to  develop  the  large  number  of  programs  needed  to  pr»  duce 
the  three  remaining  machine-derived  files. 

3)  Put  through  production  runs  to  build  the  machine-derived  files. 

4)  Disseminate  data  from  each  of  the  files. 

5)  Document  the  archive  as  it  comes  into  existence. 

During  the  next  contract  year,  emphasis  will  be  placed  on  updating  and 
improving  each  of  the  files,  adding  antonymi tive  relations  to  the  archive , 
and  facilitating  distribution  of  data  from  the  archive.  The  updating  activity 
derives  from  the  continual  addition  of  words  to  the  SUR  lexicons.  Improvements 
include  restructuring  based  on  feedback  from  users  regarding  the  utility  of  eacn 
file.  Antonymi tiva  relationships  will  be  added  to  the  semantic  field  file  to 
permit  their  incorporation  into  the  semantic  networks  of  the  SUR  systems.  To 
facilitate  use  of  the  archive,  the  project  will  produce  user's  guides,  provide 
demonstrations  of  the  use  of  SOLAR,  and  build  a  user  data-management  interface 
program  to  guide  the  user  in  performing  on-line  retrieval. 

3.4  STAFF 

Dr.  T.  C.  Diller,  Project  Leader 
J.  Olney  (Consultant) 

F.  Heath  (part-time) 

3.5  DOCUMENTATION  AND  PUBLICATIONS 

[1]  T.  Diller,  and  J.  Olney,  "SOLAR:  A  Semantically-Oriented  Lexical  Archive," 
SDC  SP-3726 ,  NIC  19927,  SUR  Note  110,  November  1973. 

[2]  T.  Diller,  "SOLAR  Bibliography  User's  Guide,"  SDC  TM-5292,  in  press. 


30  April  1974 


34 


System  Development  Corporation 
TM-  5  2 ' 3/001/00 


4.  COMMON  INFORMATION  STRUCTURES 

4.1  INTRODUCTION 

The  Common  Information  Structures  project  is  addressing  the  problem  of  converting 
and  transf erring  data  bases  among  disparate  data  management  systems  (DMSs) .  The 
need  to  share  data  for  different  applications,  as  well  as  the  need  to  transfer 
existing  data  to  be  used  by  new  computer  systems,  makes  it  apparent  that  general 
techniques  for  data  base  conversion  are  desirable.  The  goal  of  this  project  is 
to  develop  techniques  for  data  base  conversion  that  are  practical  for  applica¬ 
tion  tc  current  data  management  systems  and  that  are  des  grcd  to  be  easily  used 
by  data  base  users. 

The  difficulties  in  converting  e  data  base  from  one  data  management  system  (DMS) 
to  another  arise  from  the  fact  that  data  base  structures  are  system  and  applica¬ 
tion  dependent.  As  a  result,  tha  DMS  imposes  constraints  on  the  form  of  th'-> 
data  base.  These  constraints  are  of  three  types:  (1)  logical-level  constraints, 
such  as  level  of  hierarchies,  size  and  number  of  fields,  and  data  types; 

(2)  storage-level  constraints,  such  as  inversion  and  access  path  of  files  and 
file  indexing  organization;  and  (3)  physical- level  constraints,  such  as  physical 
devi  *s  used  and  block/record  structures.  These  levels  were  described  in  reports 
issued  for  the  1972-73  contract  year. 

The  conventional  method  of  converting  data  bases  for  new  applications  is  to  write 
a  special-purpose  conversion  program  for  each  data  base.  Another  possible 
approach  is  to  define  data  description  languages  for  all  three  levels  mentioned 
above,  then  specify  in  these  languages. the  source  and  target  data  bases,  as  well 
as  conversion  statements  between  them.  Since  this  approach  involves  all  three 
levels,  it  requires  complex  ana  detailed  data  description  languages,  which  are 
difficult  to  learn  and  to  use.  It  also  requires  that  data  be  converted  from  the 
source  physical  environment  to  the  corresponding  target  physical  environment, 
which  further  complicates  any  possible  implementation. 


This  approach  has  been  discussed  by  several  researchers.  See  D.  P.  Smith,  "An 
Approach  to  Data  Description  and  Conversion"  (Univ.  of  Penn.  Ph.D  dissertation, 
1971);  J.  A.  Ramirez,  "Automatic  Generation  of  Data  Conversion  Programs  Using 
a  Data  Description  Language"  (Univ.  of  Penn.  Ph.D.  dissertation,  1972);  and 
three  papers  published  in  Proceedings  of  the  1972  ACM  S IGF I PET  Workshop  (New 
York:  Association  for  Computing  Machinery):  "An  Approach  to  Stored  Data 
Definition  and  Translation"  (Stored  Data  Definition  and  Translation  Task  Group, 
pp.  13-56) ,  "A  Method  for  Data  Translation  Using  the  stored  Data  Definition  ar J 
Translation  Task  Group  Languages"  (D.  P.  Smith,  pp.  107-124),  and  "A  Develop¬ 
mental  Model  for  Data  Translation"  (J.  P.  Fry,  R.  L.  Frank,  and  E.  A.  Hershey, 
III,  pp.  77-106). 
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The  approach  being  taken  by  this  project  is  based  on  the  idea  that  the  deta 
conversion  process  can  depend  mainly  on  conversion  at  the  logical  level. 
Conversion  at  this  level  can  be  achieved  by  using  existing  query  and  generate 
capabilities  of  DMSs  to  move  data  from  their  physical  representation  to  the 
logical  leve3  end  vice  versa.  The  tasks  required  in  the  data  conversion  process 
are  diagrammed  'n  Figure  4-1.  First,  the  source  data  base  is  retrieved  using 
the  query  capabilities  of  the  source  UiS  and  reformatted  into  a  standard  form. 
Then,  the  data  translation  process  takas  place,  and  a  target  data  base  in  the 
standard  form  is  produced  and  reformatted  into  a  data  format  acceptable  to  the 
generate  „  pability  of  the  target  DMS.  Finally,  the  target  data  base  is  gen¬ 
erated  with  the  generate  capability  of  the  targe*.  MS. 


Figure  4-1.  Data  Flow  in  the  Data  Conversion  Process 
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4.2  PROGRESS  AND  PRESENT  STATUS 

The  first  task  of  this  project  during  the  present  contract  year  was  to  explore 
the  possibilities  of  automatically  generating  target  data  descriptions  from 
source  data  descriptions.  The  file  definition  languages  (FDLs)  of  several  DMSs 
were  studied  in  an  attempt  to  isolate  the  restrictions  these  systems  impose  on 
data  base  structures.  After  a  thorough  analysis  of  these  FDLs,  it  was  concluded 
that  automatic  generation  of  a  target  description  from  the  source  description  is 
either: 

1)  Trivial — when  the  restrictions  on  the  complexity  of  data  base  structures 
of  the  target  system  are  less  constraining  than  the  restrictions  of  the 
source  system  (for  example,  going  from  a  system  that  allows  cne  level 
of  hierarchy  into  a  system  that  allows  nine  levels  of  hierarchy) >  or 

2)  Impossible  to  predict  in  the  general  case ,  because  the  target 
description  is  semantically  dependent  on  the  intended  use  of  the 
data  base  and  on  time,  space,  and  cost  considerations. 

Consequently,  we  concluded  ♦-hat  it  would  not  be  fruitful  to  continue  in  this 
direction,  and  we  decided  to  direct  the  main  effort  of  the  project  at  the 
development  of  processes  that  actually  convert  data,  rather  than  at  the  auto¬ 
matic  generation  of  target  data  descriptions.  In  the  context  of  developing 
the  details  of  the  data  conversion  process,  the  following  tasks  were  performed: 

1)  A  detailed  study  was  made  of  the  different  data  description  languages 
and  data  conversion  approaches  described  in  the  literature.  We 
concluded  chat  these  approaches  advocate  the  use  of  details  of 

all  three  levels  of  data  description  and  are  therefore  eguallj 
difficult  to  implement  and  use.  This  led  us  to  the  current 
approach,  which  depends  mainly  on  the  logical  level  of 
data. 

2)  A  methodology  for  the  data  conversion  process  was  developed.  It 
involves  the  development  of  source  and  target  reformatters  and  a 
logical  data  translator.  These  processes  are  driven  by  statements 
written  in  three  languages,  which  are  dependent  mainly  on  logical 
characteristics  of  tne  data  to  be  converted.  These  languages  are 
the  Common  Data  Description  Language  (CDDL) ,  the  Common  Data  Trans¬ 
lation  Language  (CDTL) ,  and  the  Common  Data  Format  Language  (CDFL) . 


See  E.  H.  Sibley  and  R.  W.  Taylor,  "A  Data  Definition  and  Mapping  Language," 
Comm.  ACM,  December  1973  (pp.  750-759);  R.  W.  Taylor,  "Generalized  Data  Base 
Managemc  it  System  Data  Structures  and  their  Mapping  to  Physical  Storage" 
(Univ.  of  Michigan  Ph.D.  dissertation,  1971);  D.  P.  Smith,  "An  Approach  to 
Data  Desorption  and  Conversion"  'loc.  cit.);  and  CODASYL  Systems  Committee, 
"Feature  Analyjis  of  Generalized  Data  Base  Management  Systems"  (New  York: 
Association  for  Computing  Machinery,  1971). 
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3)  A  preliminary  version  of  CDDL  was  developed.  This  was  accomplished 
after  an  analysis  of  existing  DDLs  (especially  those  of  CODASYL  and 
Smith)  and  selecting  the  subset  that  is  relevant  to  our  approach. 

The  syntax  of  the  current  version  of  CDDL  is  given  in  Section  4.2.1. 

The  syntax  of  CDDL  represents  a  logical  view  of  hierarchical  data 
structures.  Because  it  contains  no  storage-level  or  physical- level 
requirements,  it  is  a  simple  language  to  be  specified  by  a  user.  At 
the  rame  time,  the  language  is  flexible  enough  to  represent  a  given 
data  management  system's  limitations  (such  as  hierarchy  levels)  for 
any  data  base  organized  in  this  DMS.  The  statements  in  CDDL,  together 
with  statements  in  CDTL,  supply  all  the  necessary  information  for  the 
data  conversion  process.  As  a  consequence,  CDDL  might  change  accord¬ 
ing  to  the  requirements  of  CDTL. 

4)  The  functions  necessary  for  the  Common  Data  Translation  Language  (CDTL) 
were  developed.  These  functions  are  represented  in  terms  of  conversion 
statements  between  fields  of  the  source  data  description  and  the  target 
data  description.  A  conversion  statement  consists  of  an  assocatlon 

of  one  source  field  (or  more)  to  a  target  field,  plus  an  algorithm  for 
the  conversion  of  data  (such  as  truncation,  concatenation,  or  data¬ 
type  transformation) .  Types  of  conversion  statements  were  identified 
and  form  the  basis  of  CDTL.  A  summary  of  these  types  is  given  in 
Section  4.2.2. 

4.2.1  Common  Data  Description  Language  (CDDL)  Specification 

CDDL  is  a  simple  language.  A  file  statement  consists  of  group  and  field 
statements.  A  group  statement,  in  turn,  consists  of  field  statements  and, 
possibly,  additional  group  *» v aments ,  thus  establishing  a  hierarchical 
structure.  There  are  several  types  for  fields,  and  a  repetition  number  for 
groups  puts  an  upper  limit  on  the  number  of  values  in  a  group  instance. 

datadescription  =  filestatement  { fieldstatement  |  groups tatement) 
filestatement  *  FILE  <filename  :  recordlength  :  filegroupname> 
fieldstatement  =  FIELD  <fieldname  :  da t a type > 

groupstatement  =  GROUP  <groupn«ne  :  grouplength  :  ( <FIELD  :  fieldname>  | 

<GROUP  :  groupname  :  repetition  }> 

filename  =*  {character} 
recordlength  «  integer  |  NOLIMIT 
filegroupname  =  groupname 
fieldname  «  {character} 
datatype  ■  string  j  number  j  picture 
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groupname  =  {character} 
grouplength  =  integer  j  NOLIMIT 
repetition  =  integer  |  NOLIMIT 
string  =  C  (stringlength) 

number  =  I  (int)  |  FP  (fixedparameters)  |  FL  (floatparamrters) 

picture  =*  PICT  pictspec 

integer  =  (digit) 

stringlength  =  integer 

int  =  integer 

fixtedparameters  =  int  .  int 
floatparameters  =  fixedparameters  E  int 
pictspec  =  '{#  |  @  |  *  |  character} • 
digit  =  l|2|3|4|b'6|7|b|9|j( 

character  *=  a|b|c|d|e|f|g|h|i|j|k|l|m|n|o|p|q|r|s|t|u|v|w|x|y|z| 
a|b|c|d|e|f|g|h|i|j|k!l|m|n|o|p|q|r|s|t|u|v|w|x|yjz| 
(2(|1|2|3|4|5|6|7|8|9M;|.|?|:| 

(hi  l$H+H/l»l%H 

|  "*  I  "@  |  "<  |  ">  I  ""  J "  s  I"# 

4.2.2  Types  of  Data  Conversion  Functions 

After  analyzing  file  description  languages  of  existing  DMSs,  we  concluded  that 
the  following  conversion  functions  would  be  most  useful: 

1)  Instance  mapping — represents  a  mapping  of  instances  of  a  field  of  a 
repeating  group  (RG)  into  a  field  of  a  higher  level  RG. 

2)  Inverse  instance  mapping — represents  a  combination  of  multiple  values 
of  fields  of  the  same  RG  level  into  an  RG  instance  of  a  lower  level. 

3)  Operator  mapping — allows  a  set  of  values  in  an  RG  instance  to  be 
combined  by  some  operation  (e.g.,  Average)  into  one  value  of  a  field 
in  a  higher-level  RG. 

4)  Simple  field  mapping — allows  for  an  association  of  source  and  target 
fields  according  to  a  given  algorithm  (e.g. ,  truncation) . 

Field  value  repetition — necessary  when  a  repetition  of  a  field  value 
through  values  of  a  lower- level  RG  is  required. 


5) 
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6)  Level  creation — can  be  used  to  create  a  target  RG  level  according  to 
a  criterion  cn  a  source  RG. 

7)  Field  partition — allows  the  splitting  of  an  RG  into  two  or  more  RGs 
(e.g. ,  an  emplo  ->e  PG  into  an  exempt  employee  RG  and  a  non-exempt 
employee  RG) . 

8)  Field  conjunction — the  inverse  function  to  field  partition. 

9)  Field  concatenation — necessary  when  a  target  field  is  made  up  by 
concatenating  source  fields  (or  portions  of  them) . 

10)  Intermediate  level  elimination — can  be  used  for  the  elimination  of  an 
RG  level  present  in  the  source  but  unnecessary  in  the  target  data 
base. 

11)  Inversion — necessary  when  an  alternate  view  of  the  data  base  is 
required  (e.g.,  a  department-employee  data  base  needs  to  be  re¬ 
organized  as  an  employee-department  data  base) . 

4.3  PLANS 

During  the  remainder  of  the  contract  year,  the  design  and  specification  of  CDTL 
and  the  design  of  the  logical  data  translator  will  be  completed.  Six  tasks  are 
involved • 


CDTL  Design  and  Specification 

1)  An  analysis  of  the  semantic  implications  of  data  base  conversions  to 
determine  the  desirable  combination  of  functions  in  CDTL.  This  will 
eliminate  the  possibility  of  a  data  conversion  requirement  that  is 
semantically  impossible. 

2)  The  development  of  a  specification  of  CDTL  according  to  the  functions 
required. 

3)  Design  of  the  standard  data  format  to  be  used  b-  ^he  data  translator 
and  the  reformatters. 

Logical  Data  Trims lator  Design 

4)  Design  of  a  process  that  will  perform  a  lexical  and  semantic  analysis 
of  the  source  and  target  data  base  descriptions  in  CDDL  and  of  the 
translation  statements  in  CDTL  and  produce  internal  tables  that 
represent  the  data  translation  operations  to  be  performed. 
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5)  Design  of  read/write  modules  that  can  input  data  and  generate  output 
data. 

6)  Design  of  control  modules  that  restructure  source  data  instances  into 
target  data  instances  by  utilizing  the  read/write  modules  and  the 
tables  produced  by  the  lexical  and  semantic  analysis  process. 

By  the  end  of  this  contract  year,  the  project  will  be  ready  to  begin 
implementation  of  the  conversion  functions  according  to  the  CDDL  and  CDTL 
statements.  In  parallel,  we  plan  to  study  the  functions  necessary  for  CDFL 
and  develop  it. 

4.4  STAFF 


A.  Shoshani,  Project  Leader 
K.  J.  Fogt 


